WO2022256409A2 - Compositions and methods of modulating rna and protein interactions - Google Patents
Compositions and methods of modulating rna and protein interactions Download PDFInfo
- Publication number
- WO2022256409A2 WO2022256409A2 PCT/US2022/031773 US2022031773W WO2022256409A2 WO 2022256409 A2 WO2022256409 A2 WO 2022256409A2 US 2022031773 W US2022031773 W US 2022031773W WO 2022256409 A2 WO2022256409 A2 WO 2022256409A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- rna
- composition
- nucleic acid
- sars
- protein
- Prior art date
Links
- 239000000203 mixture Substances 0.000 title claims abstract description 68
- 238000000034 method Methods 0.000 title claims abstract description 60
- 230000006916 protein interaction Effects 0.000 title description 2
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 164
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 155
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 155
- 108020004459 Small interfering RNA Proteins 0.000 claims abstract description 62
- 230000014509 gene expression Effects 0.000 claims abstract description 57
- 108091034117 Oligonucleotide Proteins 0.000 claims abstract description 41
- 208000025721 COVID-19 Diseases 0.000 claims abstract description 30
- 239000000074 antisense oligonucleotide Substances 0.000 claims abstract description 25
- 238000012230 antisense oligonucleotides Methods 0.000 claims abstract description 25
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims abstract description 21
- 108090000623 proteins and genes Proteins 0.000 claims description 179
- 101800001554 RNA-directed RNA polymerase Proteins 0.000 claims description 131
- 101800001768 Exoribonuclease Proteins 0.000 claims description 91
- 230000002401 inhibitory effect Effects 0.000 claims description 89
- 102000004169 proteins and genes Human genes 0.000 claims description 82
- 101800000935 Non-structural protein 12 Proteins 0.000 claims description 81
- 101800004575 RNA-directed RNA polymerase nsp12 Proteins 0.000 claims description 81
- 108020004999 messenger RNA Proteins 0.000 claims description 60
- 108091005774 SARS-CoV-2 proteins Proteins 0.000 claims description 58
- 101800000482 Non-structural protein 9 Proteins 0.000 claims description 47
- 241001678559 COVID-19 virus Species 0.000 claims description 36
- 230000000295 complement effect Effects 0.000 claims description 33
- 239000008194 pharmaceutical composition Substances 0.000 claims description 27
- 101710144128 Non-structural protein 2 Proteins 0.000 claims description 24
- 102100022648 Reticulon-2 Human genes 0.000 claims description 24
- 101000992423 Severe acute respiratory syndrome coronavirus 2 Putative ORF9c protein Proteins 0.000 claims description 22
- 238000012986 modification Methods 0.000 claims description 22
- 239000013598 vector Substances 0.000 claims description 22
- 239000002502 liposome Substances 0.000 claims description 21
- 230000004048 modification Effects 0.000 claims description 21
- 230000027455 binding Effects 0.000 claims description 20
- 108020000948 Antisense Oligonucleotides Proteins 0.000 claims description 18
- 102100038284 Cytospin-B Human genes 0.000 claims description 16
- 101710144111 Non-structural protein 3 Proteins 0.000 claims description 15
- 101710144121 Non-structural protein 5 Proteins 0.000 claims description 15
- 102100021798 SH2 domain-containing protein 3C Human genes 0.000 claims description 15
- 101800000578 Uridylate-specific endoribonuclease Proteins 0.000 claims description 15
- 101800000510 Non-structural protein 7 Proteins 0.000 claims description 13
- 101710144118 Non-structural protein 6 Proteins 0.000 claims description 12
- 101800001704 Guanine-N7 methyltransferase Proteins 0.000 claims description 10
- 101800001862 Proofreading exoribonuclease Proteins 0.000 claims description 10
- 101800002929 Proofreading exoribonuclease nsp14 Proteins 0.000 claims description 10
- 101000672024 Homo sapiens UDP-glucose:glycoprotein glucosyltransferase 1 Proteins 0.000 claims description 9
- 208000037847 SARS-CoV-2-infection Diseases 0.000 claims description 9
- 101001086079 Severe acute respiratory syndrome coronavirus 2 Putative ORF3b protein Proteins 0.000 claims description 9
- 102100040363 UDP-glucose:glycoprotein glucosyltransferase 1 Human genes 0.000 claims description 9
- 102100023949 Cytochrome c oxidase subunit NDUFA4 Human genes 0.000 claims description 8
- 101001111225 Homo sapiens Cytochrome c oxidase subunit NDUFA4 Proteins 0.000 claims description 8
- 101800001255 Putative 2'-O-methyl transferase Proteins 0.000 claims description 8
- 101000596375 Severe acute respiratory syndrome coronavirus 2 ORF7b protein Proteins 0.000 claims description 8
- 108091026890 Coding region Proteins 0.000 claims description 7
- 238000006467 substitution reaction Methods 0.000 claims description 6
- 108091023045 Untranslated Region Proteins 0.000 claims description 5
- 101800001927 Uridylate-specific endoribonuclease nsp15 Proteins 0.000 claims description 5
- 239000013603 viral vector Substances 0.000 claims description 5
- 102100040962 26S proteasome non-ATPase regulatory subunit 13 Human genes 0.000 claims description 4
- 108700041152 Endoplasmic Reticulum Chaperone BiP Proteins 0.000 claims description 4
- 102100021451 Endoplasmic reticulum chaperone BiP Human genes 0.000 claims description 4
- 101150112743 HSPA5 gene Proteins 0.000 claims description 4
- 101000612536 Homo sapiens 26S proteasome non-ATPase regulatory subunit 13 Proteins 0.000 claims description 4
- 101001051207 Homo sapiens L-lactate dehydrogenase B chain Proteins 0.000 claims description 4
- 101000946053 Homo sapiens Lysosomal-associated transmembrane protein 4A Proteins 0.000 claims description 4
- 102100024580 L-lactate dehydrogenase B chain Human genes 0.000 claims description 4
- 241000713666 Lentivirus Species 0.000 claims description 4
- 108010009254 Lysosomal-Associated Membrane Protein 1 Proteins 0.000 claims description 4
- 102100034728 Lysosomal-associated transmembrane protein 4A Human genes 0.000 claims description 4
- 102100035133 Lysosome-associated membrane glycoprotein 1 Human genes 0.000 claims description 4
- 230000007423 decrease Effects 0.000 claims description 4
- 241000701161 unidentified adenovirus Species 0.000 claims description 4
- 101150060120 C1qbp gene Proteins 0.000 claims description 3
- 102100037078 Complement component 1 Q subcomponent-binding protein, mitochondrial Human genes 0.000 claims description 3
- 102100039486 Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit 4 Human genes 0.000 claims description 3
- 101000609775 Homo sapiens Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit 4 Proteins 0.000 claims description 3
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 3
- 108091036066 Three prime untranslated region Proteins 0.000 claims description 3
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical group [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 claims description 2
- 102100032303 26S proteasome non-ATPase regulatory subunit 2 Human genes 0.000 claims 1
- 101000590272 Homo sapiens 26S proteasome non-ATPase regulatory subunit 2 Proteins 0.000 claims 1
- 101000848781 Homo sapiens Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit 1 Proteins 0.000 claims 1
- 239000004055 small Interfering RNA Substances 0.000 abstract description 18
- 229920002477 rna polymer Polymers 0.000 description 150
- 210000004027 cell Anatomy 0.000 description 106
- 235000018102 proteins Nutrition 0.000 description 74
- 238000012159 eCLIP Methods 0.000 description 59
- 230000003993 interaction Effects 0.000 description 40
- 101800000509 Non-structural protein 8 Proteins 0.000 description 38
- 125000003729 nucleotide group Chemical group 0.000 description 35
- 239000002773 nucleotide Substances 0.000 description 33
- 108010067390 Viral Proteins Proteins 0.000 description 26
- -1 threose nucleic acids Chemical class 0.000 description 26
- 230000003612 virological effect Effects 0.000 description 24
- 108020004414 DNA Proteins 0.000 description 22
- 102000053602 DNA Human genes 0.000 description 22
- 230000014616 translation Effects 0.000 description 22
- 230000000694 effects Effects 0.000 description 21
- 238000013519 translation Methods 0.000 description 21
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 20
- 108020000999 Viral RNA Proteins 0.000 description 19
- 230000008569 process Effects 0.000 description 19
- 241000700605 Viruses Species 0.000 description 18
- 230000008859 change Effects 0.000 description 16
- 238000009472 formulation Methods 0.000 description 16
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 15
- 150000001875 compounds Chemical class 0.000 description 15
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 15
- 108020003589 5' Untranslated Regions Proteins 0.000 description 14
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 14
- 102000040430 polynucleotide Human genes 0.000 description 14
- 108091033319 polynucleotide Proteins 0.000 description 14
- 239000002157 polynucleotide Substances 0.000 description 14
- 230000000692 anti-sense effect Effects 0.000 description 13
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 13
- 230000009385 viral infection Effects 0.000 description 13
- 208000036142 Viral infection Diseases 0.000 description 12
- 201000010099 disease Diseases 0.000 description 12
- 108091093037 Peptide nucleic acid Proteins 0.000 description 11
- 101710172711 Structural protein Proteins 0.000 description 11
- 208000015181 infectious disease Diseases 0.000 description 11
- 210000004072 lung Anatomy 0.000 description 11
- 230000001225 therapeutic effect Effects 0.000 description 11
- 108060001084 Luciferase Proteins 0.000 description 10
- 239000005089 Luciferase Substances 0.000 description 10
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 10
- 108020005345 3' Untranslated Regions Proteins 0.000 description 9
- RYVNIFSIEDRLSJ-UHFFFAOYSA-N 5-(hydroxymethyl)cytosine Chemical compound NC=1NC(=O)N=CC=1CO RYVNIFSIEDRLSJ-UHFFFAOYSA-N 0.000 description 9
- 229930024421 Adenine Natural products 0.000 description 9
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 9
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 9
- 229960000643 adenine Drugs 0.000 description 9
- 125000000217 alkyl group Chemical group 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 8
- 108020005067 RNA Splice Sites Proteins 0.000 description 8
- 238000000692 Student's t-test Methods 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 239000001257 hydrogen Substances 0.000 description 8
- 229910052739 hydrogen Inorganic materials 0.000 description 8
- 230000001965 increasing effect Effects 0.000 description 8
- 150000002632 lipids Chemical class 0.000 description 8
- 230000002829 reductive effect Effects 0.000 description 8
- 238000012353 t test Methods 0.000 description 8
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 7
- 101800003471 Helicase Proteins 0.000 description 7
- 238000001276 Kolmogorov–Smirnov test Methods 0.000 description 7
- 101710144127 Non-structural protein 1 Proteins 0.000 description 7
- 102100031776 SH2 domain-containing protein 3A Human genes 0.000 description 7
- 108091081021 Sense strand Proteins 0.000 description 7
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 7
- 239000013543 active substance Substances 0.000 description 7
- 229940104302 cytosine Drugs 0.000 description 7
- 230000001086 cytosolic effect Effects 0.000 description 7
- 239000003814 drug Substances 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 208000024891 symptom Diseases 0.000 description 7
- 230000008685 targeting Effects 0.000 description 7
- 241000711573 Coronaviridae Species 0.000 description 6
- 102000004127 Cytokines Human genes 0.000 description 6
- 108090000695 Cytokines Proteins 0.000 description 6
- 101000658071 Homo sapiens Splicing factor U2AF 65 kDa subunit Proteins 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 210000002220 organoid Anatomy 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 102000004196 processed proteins & peptides Human genes 0.000 description 6
- 108090000765 processed proteins & peptides Proteins 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 239000000758 substrate Substances 0.000 description 6
- 229940035893 uracil Drugs 0.000 description 6
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 5
- 102100034583 Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit 1 Human genes 0.000 description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 5
- 238000011529 RT qPCR Methods 0.000 description 5
- 102100035040 Splicing factor U2AF 65 kDa subunit Human genes 0.000 description 5
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 5
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 5
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 5
- 239000007859 condensation product Substances 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 235000014113 dietary fatty acids Nutrition 0.000 description 5
- 241001493065 dsRNA viruses Species 0.000 description 5
- 210000002919 epithelial cell Anatomy 0.000 description 5
- 150000002148 esters Chemical class 0.000 description 5
- 239000000194 fatty acid Substances 0.000 description 5
- 229930195729 fatty acid Natural products 0.000 description 5
- 150000004665 fatty acids Chemical class 0.000 description 5
- 238000001476 gene delivery Methods 0.000 description 5
- 230000009368 gene silencing by RNA Effects 0.000 description 5
- 125000000325 methylidene group Chemical group [H]C([H])=* 0.000 description 5
- 239000013642 negative control Substances 0.000 description 5
- 230000037361 pathway Effects 0.000 description 5
- 229920001184 polypeptide Polymers 0.000 description 5
- 239000003755 preservative agent Substances 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000000159 protein binding assay Methods 0.000 description 5
- 238000003762 quantitative reverse transcription PCR Methods 0.000 description 5
- 108010074916 ribophorin Proteins 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 229940113082 thymine Drugs 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 230000032258 transport Effects 0.000 description 5
- 241000494545 Cordyline virus 2 Species 0.000 description 4
- IAYPIBMASNFSPL-UHFFFAOYSA-N Ethylene oxide Chemical compound C1CO1 IAYPIBMASNFSPL-UHFFFAOYSA-N 0.000 description 4
- 108700024394 Exon Proteins 0.000 description 4
- 101000597417 Homo sapiens Nuclear RNA export factor 1 Proteins 0.000 description 4
- 101000665882 Homo sapiens Retinol-binding protein 4 Proteins 0.000 description 4
- 101710125418 Major capsid protein Proteins 0.000 description 4
- 102100035402 Nuclear RNA export factor 1 Human genes 0.000 description 4
- 150000001408 amides Chemical group 0.000 description 4
- 230000000840 anti-viral effect Effects 0.000 description 4
- 230000000903 blocking effect Effects 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 239000002552 dosage form Substances 0.000 description 4
- 239000000839 emulsion Substances 0.000 description 4
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 238000001990 intravenous administration Methods 0.000 description 4
- 108010026228 mRNA guanylyltransferase Proteins 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 4
- 239000002777 nucleoside Substances 0.000 description 4
- 150000003833 nucleoside derivatives Chemical class 0.000 description 4
- 230000002018 overexpression Effects 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 239000000546 pharmaceutical excipient Substances 0.000 description 4
- 229920001223 polyethylene glycol Polymers 0.000 description 4
- 230000001566 pro-viral effect Effects 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 230000000087 stabilizing effect Effects 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- ZORQXIQZAOLNGE-UHFFFAOYSA-N 1,1-difluorocyclohexane Chemical compound FC1(F)CCCCC1 ZORQXIQZAOLNGE-UHFFFAOYSA-N 0.000 description 3
- 101800001779 2'-O-methyltransferase Proteins 0.000 description 3
- 101800003073 2'-O-methyltransferase nsp16 Proteins 0.000 description 3
- OVONXEQGWXGFJD-UHFFFAOYSA-N 4-sulfanylidene-1h-pyrimidin-2-one Chemical compound SC=1C=CNC(=O)N=1 OVONXEQGWXGFJD-UHFFFAOYSA-N 0.000 description 3
- MSSXOMSJDRHRMC-UHFFFAOYSA-N 9H-purine-2,6-diamine Chemical compound NC1=NC(N)=C2NC=NC2=N1 MSSXOMSJDRHRMC-UHFFFAOYSA-N 0.000 description 3
- 241000282552 Chlorocebus aethiops Species 0.000 description 3
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 3
- 108090000331 Firefly luciferases Proteins 0.000 description 3
- 101800000355 Helicase nsp10 Proteins 0.000 description 3
- 101001118493 Homo sapiens Nuclear pore glycoprotein p62 Proteins 0.000 description 3
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- 241000254158 Lampyridae Species 0.000 description 3
- 239000000020 Nitrocellulose Substances 0.000 description 3
- 101800000933 Non-structural protein 10 Proteins 0.000 description 3
- 108090000163 Nuclear pore complex proteins Proteins 0.000 description 3
- 102000003789 Nuclear pore complex proteins Human genes 0.000 description 3
- 102100024057 Nuclear pore glycoprotein p62 Human genes 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- 239000004952 Polyamide Substances 0.000 description 3
- 239000002202 Polyethylene glycol Substances 0.000 description 3
- 101710146873 Receptor-binding protein Proteins 0.000 description 3
- 101710137011 Retinol-binding protein 4 Proteins 0.000 description 3
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 3
- 108091006197 SARS-CoV-2 Nucleocapsid Protein Proteins 0.000 description 3
- 102100024544 SURP and G-patch domain-containing protein 1 Human genes 0.000 description 3
- 101000779242 Severe acute respiratory syndrome coronavirus 2 ORF3a protein Proteins 0.000 description 3
- 101000596353 Severe acute respiratory syndrome coronavirus 2 ORF7a protein Proteins 0.000 description 3
- 108091027967 Small hairpin RNA Proteins 0.000 description 3
- 229930006000 Sucrose Natural products 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 239000007900 aqueous suspension Substances 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 239000003086 colorant Substances 0.000 description 3
- 125000000753 cycloalkyl group Chemical group 0.000 description 3
- 208000035475 disorder Diseases 0.000 description 3
- 230000003828 downregulation Effects 0.000 description 3
- 230000009977 dual effect Effects 0.000 description 3
- 239000003995 emulsifying agent Substances 0.000 description 3
- 230000002708 enhancing effect Effects 0.000 description 3
- 239000000796 flavoring agent Substances 0.000 description 3
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 3
- 238000010166 immunofluorescence Methods 0.000 description 3
- 238000001114 immunoprecipitation Methods 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 239000004005 microsphere Substances 0.000 description 3
- 229920001220 nitrocellulos Polymers 0.000 description 3
- 102000042567 non-coding RNA Human genes 0.000 description 3
- 231100000252 nontoxic Toxicity 0.000 description 3
- 230000003000 nontoxic effect Effects 0.000 description 3
- 230000003285 pharmacodynamic effect Effects 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- 229920002647 polyamide Polymers 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 235000002639 sodium chloride Nutrition 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 239000005720 sucrose Substances 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 239000003765 sweetening agent Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000011282 treatment Methods 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- 241001430294 unidentified retrovirus Species 0.000 description 3
- 239000003981 vehicle Substances 0.000 description 3
- 230000029812 viral genome replication Effects 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 239000000080 wetting agent Substances 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- PUPZLCDOIYMWBV-UHFFFAOYSA-N (+/-)-1,3-Butanediol Chemical compound CC(O)CCO PUPZLCDOIYMWBV-UHFFFAOYSA-N 0.000 description 2
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 2
- ZLAQATDNGLKIEV-UHFFFAOYSA-N 5-methyl-2-sulfanylidene-1h-pyrimidin-4-one Chemical compound CC1=CNC(=S)NC1=O ZLAQATDNGLKIEV-UHFFFAOYSA-N 0.000 description 2
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 2
- PEHVGBZKEYRQSX-UHFFFAOYSA-N 7-deaza-adenine Chemical compound NC1=NC=NC2=C1C=CN2 PEHVGBZKEYRQSX-UHFFFAOYSA-N 0.000 description 2
- LOSIULRWFAEMFL-UHFFFAOYSA-N 7-deazaguanine Chemical compound O=C1NC(N)=NC2=C1CC=N2 LOSIULRWFAEMFL-UHFFFAOYSA-N 0.000 description 2
- HCGHYQLFMPXSDU-UHFFFAOYSA-N 7-methyladenine Chemical compound C1=NC(N)=C2N(C)C=NC2=N1 HCGHYQLFMPXSDU-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- LPXQRXLUHJKZIE-UHFFFAOYSA-N 8-azaguanine Chemical compound NC1=NC(O)=C2NN=NC2=N1 LPXQRXLUHJKZIE-UHFFFAOYSA-N 0.000 description 2
- 229960005508 8-azaguanine Drugs 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- 244000215068 Acacia senegal Species 0.000 description 2
- 235000006491 Acacia senegal Nutrition 0.000 description 2
- 102100034613 Annexin A2 Human genes 0.000 description 2
- 241000416162 Astragalus gummifer Species 0.000 description 2
- 102100032985 CCR4-NOT transcription complex subunit 7 Human genes 0.000 description 2
- 206010013975 Dyspnoeas Diseases 0.000 description 2
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 2
- 108010020195 FLAG peptide Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 229920000084 Gum arabic Polymers 0.000 description 2
- 101800002870 Helicase nsp13 Proteins 0.000 description 2
- 101000924474 Homo sapiens Annexin A2 Proteins 0.000 description 2
- 101000942580 Homo sapiens CCR4-NOT transcription complex subunit 7 Proteins 0.000 description 2
- 101000996563 Homo sapiens Nuclear pore complex protein Nup214 Proteins 0.000 description 2
- 101001007901 Homo sapiens Nuclear pore complex protein Nup88 Proteins 0.000 description 2
- 101001121642 Homo sapiens Nucleoporin p54 Proteins 0.000 description 2
- 101000973997 Homo sapiens Nucleosome assembly protein 1-like 4 Proteins 0.000 description 2
- 101000987310 Homo sapiens Serine/threonine-protein kinase PAK 2 Proteins 0.000 description 2
- 101000762128 Homo sapiens Tumor suppressor candidate 3 Proteins 0.000 description 2
- 101000644174 Homo sapiens Uridine phosphorylase 1 Proteins 0.000 description 2
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 108091054455 MAP kinase family Proteins 0.000 description 2
- 102000043136 MAP kinase family Human genes 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 101710085938 Matrix protein Proteins 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- 101710127721 Membrane protein Proteins 0.000 description 2
- 101710138767 Non-structural glycoprotein 4 Proteins 0.000 description 2
- 101800000932 Non-structural protein 11 Proteins 0.000 description 2
- 101800000934 Non-structural protein 13 Proteins 0.000 description 2
- 102100033819 Nuclear pore complex protein Nup214 Human genes 0.000 description 2
- 102100027586 Nuclear pore complex protein Nup88 Human genes 0.000 description 2
- 102100025453 Nucleoporin p54 Human genes 0.000 description 2
- 101710141454 Nucleoprotein Proteins 0.000 description 2
- 102100022396 Nucleosome assembly protein 1-like 4 Human genes 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- 101710087110 ORF6 protein Proteins 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- 230000004570 RNA-binding Effects 0.000 description 2
- 241000242739 Renilla Species 0.000 description 2
- 108010052090 Renilla Luciferases Proteins 0.000 description 2
- 102100031056 Serine protease 57 Human genes 0.000 description 2
- 101710197596 Serine protease 57 Proteins 0.000 description 2
- 102100027939 Serine/threonine-protein kinase PAK 2 Human genes 0.000 description 2
- 101000992426 Severe acute respiratory syndrome coronavirus 2 ORF9b protein Proteins 0.000 description 2
- 101710193546 Tegument protein VP16 homolog Proteins 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 229920001615 Tragacanth Polymers 0.000 description 2
- 102100024248 Tumor suppressor candidate 3 Human genes 0.000 description 2
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 description 2
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 2
- 102100020892 Uridine phosphorylase 1 Human genes 0.000 description 2
- 101800001925 Uridylate-specific endoribonuclease nsp11 Proteins 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 238000001790 Welch's t-test Methods 0.000 description 2
- 235000010489 acacia gum Nutrition 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 235000001014 amino acid Nutrition 0.000 description 2
- 230000006907 apoptotic process Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 239000012876 carrier material Substances 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 125000002091 cationic group Chemical group 0.000 description 2
- 230000034303 cell budding Effects 0.000 description 2
- 230000022131 cell cycle Effects 0.000 description 2
- 210000003855 cell nucleus Anatomy 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical group C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 235000012000 cholesterol Nutrition 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 230000001186 cumulative effect Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000000368 destabilizing effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 239000002270 dispersing agent Substances 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 235000013355 food flavoring agent Nutrition 0.000 description 2
- 235000003599 food sweetener Nutrition 0.000 description 2
- 238000007306 functionalization reaction Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- 125000001475 halogen functional group Chemical group 0.000 description 2
- 125000005842 heteroatom Chemical group 0.000 description 2
- 125000000623 heterocyclic group Chemical group 0.000 description 2
- 230000006658 host protein synthesis Effects 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000002757 inflammatory effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 210000004779 membrane envelope Anatomy 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 2
- 230000002438 mitochondrial effect Effects 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 239000000346 nonvolatile oil Substances 0.000 description 2
- 210000004492 nuclear pore Anatomy 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 235000019198 oils Nutrition 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 230000003094 perturbing effect Effects 0.000 description 2
- RDOWQLZANAYVLL-UHFFFAOYSA-N phenanthridine Chemical compound C1=CC=C2C3=CC=CC=C3C=NC2=C1 RDOWQLZANAYVLL-UHFFFAOYSA-N 0.000 description 2
- 150000004713 phosphodiesters Chemical group 0.000 description 2
- 150000003904 phospholipids Chemical class 0.000 description 2
- 229910052698 phosphorus Inorganic materials 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 230000004962 physiological condition Effects 0.000 description 2
- 229920000768 polyamine Polymers 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 2
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 2
- 230000001124 posttranscriptional effect Effects 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- QELSKZZBTMNZEB-UHFFFAOYSA-N propylparaben Chemical compound CCCOC(=O)C1=CC=C(O)C=C1 QELSKZZBTMNZEB-UHFFFAOYSA-N 0.000 description 2
- 230000013587 protein N-linked glycosylation Effects 0.000 description 2
- 150000003230 pyrimidines Chemical class 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 108020004418 ribosomal RNA Proteins 0.000 description 2
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 235000011069 sorbitan monooleate Nutrition 0.000 description 2
- 239000001593 sorbitan monooleate Substances 0.000 description 2
- 229940035049 sorbitan monooleate Drugs 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- 239000000375 suspending agent Substances 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 230000003827 upregulation Effects 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 108700026220 vif Genes Proteins 0.000 description 2
- 230000005727 virus proliferation Effects 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- LNAZSHAWQACDHT-XIYTZBAFSA-N (2r,3r,4s,5r,6s)-4,5-dimethoxy-2-(methoxymethyl)-3-[(2s,3r,4s,5r,6r)-3,4,5-trimethoxy-6-(methoxymethyl)oxan-2-yl]oxy-6-[(2r,3r,4s,5r,6r)-4,5,6-trimethoxy-2-(methoxymethyl)oxan-3-yl]oxyoxane Chemical compound CO[C@@H]1[C@@H](OC)[C@H](OC)[C@@H](COC)O[C@H]1O[C@H]1[C@H](OC)[C@@H](OC)[C@H](O[C@H]2[C@@H]([C@@H](OC)[C@H](OC)O[C@@H]2COC)OC)O[C@@H]1COC LNAZSHAWQACDHT-XIYTZBAFSA-N 0.000 description 1
- LOGFVTREOLYCPF-KXNHARMFSA-N (2s,3r)-2-[[(2r)-1-[(2s)-2,6-diaminohexanoyl]pyrrolidine-2-carbonyl]amino]-3-hydroxybutanoic acid Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-KXNHARMFSA-N 0.000 description 1
- BHQCQFFYRZLCQQ-UHFFFAOYSA-N (3alpha,5alpha,7alpha,12alpha)-3,7,12-trihydroxy-cholan-24-oic acid Natural products OC1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 BHQCQFFYRZLCQQ-UHFFFAOYSA-N 0.000 description 1
- QGVQZRDQPDLHHV-DPAQBDIFSA-N (3s,8s,9s,10r,13r,14s,17r)-10,13-dimethyl-17-[(2r)-6-methylheptan-2-yl]-2,3,4,7,8,9,11,12,14,15,16,17-dodecahydro-1h-cyclopenta[a]phenanthrene-3-thiol Chemical compound C1C=C2C[C@@H](S)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 QGVQZRDQPDLHHV-DPAQBDIFSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- DNIAPMSPPWPWGF-GSVOUGTGSA-N (R)-(-)-Propylene glycol Chemical compound C[C@@H](O)CO DNIAPMSPPWPWGF-GSVOUGTGSA-N 0.000 description 1
- JLPULHDHAOZNQI-ZTIMHPMXSA-N 1-hexadecanoyl-2-(9Z,12Z-octadecadienoyl)-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/C\C=C/CCCCC JLPULHDHAOZNQI-ZTIMHPMXSA-N 0.000 description 1
- IXPNQXFRVYWDDI-UHFFFAOYSA-N 1-methyl-2,4-dioxo-1,3-diazinane-5-carboximidamide Chemical compound CN1CC(C(N)=N)C(=O)NC1=O IXPNQXFRVYWDDI-UHFFFAOYSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- UHUHBFMZVCOEOV-UHFFFAOYSA-N 1h-imidazo[4,5-c]pyridin-4-amine Chemical compound NC1=NC=CC2=C1N=CN2 UHUHBFMZVCOEOV-UHFFFAOYSA-N 0.000 description 1
- VEPOHXYIFQMVHW-XOZOLZJESA-N 2,3-dihydroxybutanedioic acid (2S,3S)-3,4-dimethyl-2-phenylmorpholine Chemical compound OC(C(O)C(O)=O)C(O)=O.C[C@H]1[C@@H](OCCN1C)c1ccccc1 VEPOHXYIFQMVHW-XOZOLZJESA-N 0.000 description 1
- PIINGYXNCHTJTF-UHFFFAOYSA-N 2-(2-azaniumylethylamino)acetate Chemical group NCCNCC(O)=O PIINGYXNCHTJTF-UHFFFAOYSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- 125000004200 2-methoxyethyl group Chemical group [H]C([H])([H])OC([H])([H])C([H])([H])* 0.000 description 1
- YNFSUOFXEVCDTC-UHFFFAOYSA-N 2-n-methyl-7h-purine-2,6-diamine Chemical compound CNC1=NC(N)=C2NC=NC2=N1 YNFSUOFXEVCDTC-UHFFFAOYSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- OALHHIHQOFIMEF-UHFFFAOYSA-N 3',6'-dihydroxy-2',4',5',7'-tetraiodo-3h-spiro[2-benzofuran-1,9'-xanthene]-3-one Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC(I)=C(O)C(I)=C1OC1=C(I)C(O)=C(I)C=C21 OALHHIHQOFIMEF-UHFFFAOYSA-N 0.000 description 1
- CYDQOEWLBCCFJZ-UHFFFAOYSA-N 4-(4-fluorophenyl)oxane-4-carboxylic acid Chemical compound C=1C=C(F)C=CC=1C1(C(=O)O)CCOCC1 CYDQOEWLBCCFJZ-UHFFFAOYSA-N 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical compound BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- JDBGXEHEIRGOBU-UHFFFAOYSA-N 5-hydroxymethyluracil Chemical compound OCC1=CNC(=O)NC1=O JDBGXEHEIRGOBU-UHFFFAOYSA-N 0.000 description 1
- UJBCLAXPPIDQEE-UHFFFAOYSA-N 5-prop-1-ynyl-1h-pyrimidine-2,4-dione Chemical compound CC#CC1=CNC(=O)NC1=O UJBCLAXPPIDQEE-UHFFFAOYSA-N 0.000 description 1
- KXBCLNRMQPRVTP-UHFFFAOYSA-N 6-amino-1,5-dihydroimidazo[4,5-c]pyridin-4-one Chemical compound O=C1NC(N)=CC2=C1N=CN2 KXBCLNRMQPRVTP-UHFFFAOYSA-N 0.000 description 1
- DCPSTSVLRXOYGS-UHFFFAOYSA-N 6-amino-1h-pyrimidine-2-thione Chemical compound NC1=CC=NC(S)=N1 DCPSTSVLRXOYGS-UHFFFAOYSA-N 0.000 description 1
- CKOMXBHMKXXTNW-UHFFFAOYSA-N 6-methyladenine Chemical compound CNC1=NC=NC2=C1N=CN2 CKOMXBHMKXXTNW-UHFFFAOYSA-N 0.000 description 1
- HRYKDUPGBWLLHO-UHFFFAOYSA-N 8-azaadenine Chemical compound NC1=NC=NC2=NNN=C12 HRYKDUPGBWLLHO-UHFFFAOYSA-N 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- 230000002407 ATP formation Effects 0.000 description 1
- 108020005176 AU Rich Elements Proteins 0.000 description 1
- 208000035657 Abasia Diseases 0.000 description 1
- 208000010470 Ageusia Diseases 0.000 description 1
- 241000710929 Alphavirus Species 0.000 description 1
- 102100032187 Androgen receptor Human genes 0.000 description 1
- 102100035765 Angiotensin-converting enzyme 2 Human genes 0.000 description 1
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 1
- 206010002653 Anosmia Diseases 0.000 description 1
- 108020004491 Antisense DNA Proteins 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 108091023037 Aptamer Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 108010011485 Aspartame Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 102100022970 Basic leucine zipper transcriptional factor ATF-like Human genes 0.000 description 1
- 241000008904 Betacoronavirus Species 0.000 description 1
- 102100024506 Bone morphogenetic protein 2 Human genes 0.000 description 1
- 102100036150 C-X-C motif chemokine 5 Human genes 0.000 description 1
- QCMYYKRYFNMIEC-UHFFFAOYSA-N COP(O)=O Chemical class COP(O)=O QCMYYKRYFNMIEC-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 239000004380 Cholic acid Substances 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 102100031673 Corneodesmosin Human genes 0.000 description 1
- 101710139375 Corneodesmosin Proteins 0.000 description 1
- 206010011224 Cough Diseases 0.000 description 1
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 description 1
- 108090000365 Cytochrome-c oxidases Proteins 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 102000016559 DNA Primase Human genes 0.000 description 1
- 108010092681 DNA Primase Proteins 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 206010012735 Diarrhoea Diseases 0.000 description 1
- 102100031480 Dual specificity mitogen-activated protein kinase kinase 1 Human genes 0.000 description 1
- 208000000059 Dyspnea Diseases 0.000 description 1
- 108050002772 E3 ubiquitin-protein ligase Mdm2 Proteins 0.000 description 1
- 102000012199 E3 ubiquitin-protein ligase Mdm2 Human genes 0.000 description 1
- 108700015856 ELAV-Like Protein 1 Proteins 0.000 description 1
- 102100034235 ELAV-like protein 1 Human genes 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 101150076616 EPHA2 gene Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 101710204837 Envelope small membrane protein Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102100030340 Ephrin type-A receptor 2 Human genes 0.000 description 1
- 102100038577 F-box/WD repeat-containing protein 11 Human genes 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 229930186217 Glycolipid Natural products 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 102100034221 Growth-regulated alpha protein Human genes 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- 206010019233 Headaches Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 102100022823 Histone RNA hairpin-binding protein Human genes 0.000 description 1
- 101000903742 Homo sapiens Basic leucine zipper transcriptional factor ATF-like Proteins 0.000 description 1
- 101000762366 Homo sapiens Bone morphogenetic protein 2 Proteins 0.000 description 1
- 101000947186 Homo sapiens C-X-C motif chemokine 5 Proteins 0.000 description 1
- 101001030696 Homo sapiens F-box/WD repeat-containing protein 11 Proteins 0.000 description 1
- 101001069921 Homo sapiens Growth-regulated alpha protein Proteins 0.000 description 1
- 101000825762 Homo sapiens Histone RNA hairpin-binding protein Proteins 0.000 description 1
- 101001052493 Homo sapiens Mitogen-activated protein kinase 1 Proteins 0.000 description 1
- 101001059984 Homo sapiens Mitogen-activated protein kinase kinase kinase kinase 4 Proteins 0.000 description 1
- 101001109700 Homo sapiens Nuclear receptor subfamily 4 group A member 1 Proteins 0.000 description 1
- 101001121636 Homo sapiens Nucleoporin p58/p45 Proteins 0.000 description 1
- 101000952113 Homo sapiens Probable ATP-dependent RNA helicase DDX5 Proteins 0.000 description 1
- 101000665452 Homo sapiens RNA binding protein fox-1 homolog 2 Proteins 0.000 description 1
- 101000893689 Homo sapiens Ras GTPase-activating protein-binding protein 1 Proteins 0.000 description 1
- 101000644537 Homo sapiens Sequestosome-1 Proteins 0.000 description 1
- 101001001645 Homo sapiens Serine/threonine-protein kinase pim-3 Proteins 0.000 description 1
- 101000808799 Homo sapiens Splicing factor U2AF 35 kDa subunit Proteins 0.000 description 1
- 101000962469 Homo sapiens Transcription factor MafF Proteins 0.000 description 1
- 101000976373 Homo sapiens YTH domain-containing protein 1 Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 101710203526 Integrase Proteins 0.000 description 1
- 102000006992 Interferon-alpha Human genes 0.000 description 1
- 108010047761 Interferon-alpha Proteins 0.000 description 1
- 102000003996 Interferon-beta Human genes 0.000 description 1
- 108090000467 Interferon-beta Proteins 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- 108010074328 Interferon-gamma Proteins 0.000 description 1
- 102000003777 Interleukin-1 beta Human genes 0.000 description 1
- 108090000193 Interleukin-1 beta Proteins 0.000 description 1
- 102000004125 Interleukin-1alpha Human genes 0.000 description 1
- 108010082786 Interleukin-1alpha Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 108091007460 Long intergenic noncoding RNA Proteins 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101710145006 Lysis protein Proteins 0.000 description 1
- 108091007767 MALAT1 Proteins 0.000 description 1
- 108010068342 MAP Kinase Kinase 1 Proteins 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102100024193 Mitogen-activated protein kinase 1 Human genes 0.000 description 1
- 102100028194 Mitogen-activated protein kinase kinase kinase kinase 4 Human genes 0.000 description 1
- 208000000112 Myalgia Diseases 0.000 description 1
- HSHXDCVZWHOWCS-UHFFFAOYSA-N N'-hexadecylthiophene-2-carbohydrazide Chemical compound CCCCCCCCCCCCCCCCNNC(=O)c1cccs1 HSHXDCVZWHOWCS-UHFFFAOYSA-N 0.000 description 1
- BKAYIFDRRZZKNF-VIFPVBQESA-N N-acetylcarnosine Chemical compound CC(=O)NCCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BKAYIFDRRZZKNF-VIFPVBQESA-N 0.000 description 1
- 102000003945 NF-kappa B Human genes 0.000 description 1
- 108010057466 NF-kappa B Proteins 0.000 description 1
- 206010028813 Nausea Diseases 0.000 description 1
- 108020003217 Nuclear RNA Proteins 0.000 description 1
- 102000043141 Nuclear RNA Human genes 0.000 description 1
- 102000008731 Nuclear RNA export factor Human genes 0.000 description 1
- 108050000506 Nuclear RNA export factor Proteins 0.000 description 1
- 102100022679 Nuclear receptor subfamily 4 group A member 1 Human genes 0.000 description 1
- 102100025794 Nucleoporin p58/p45 Human genes 0.000 description 1
- 229910004679 ONO2 Inorganic materials 0.000 description 1
- REYJJPSVUYRZGE-UHFFFAOYSA-N Octadecylamine Chemical compound CCCCCCCCCCCCCCCCCCN REYJJPSVUYRZGE-UHFFFAOYSA-N 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 206010068319 Oropharyngeal pain Diseases 0.000 description 1
- 239000012826 P38 inhibitor Substances 0.000 description 1
- 108010047613 PTB-Associated Splicing Factor Proteins 0.000 description 1
- 239000005662 Paraffin oil Substances 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 201000007100 Pharyngitis Diseases 0.000 description 1
- PCNDJXKNXGMECE-UHFFFAOYSA-N Phenazine Natural products C1=CC=CC2=NC3=CC=CC=C3N=C21 PCNDJXKNXGMECE-UHFFFAOYSA-N 0.000 description 1
- 241000283966 Pholidota <mammal> Species 0.000 description 1
- ABLZXFCXXLZCGV-UHFFFAOYSA-N Phosphorous acid Chemical class OP(O)=O ABLZXFCXXLZCGV-UHFFFAOYSA-N 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 206010035664 Pneumonia Diseases 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 102100037434 Probable ATP-dependent RNA helicase DDX5 Human genes 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 208000009341 RNA Virus Infections Diseases 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 101710118046 RNA-directed RNA polymerase Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- 102100040854 Ras GTPase-activating protein-binding protein 1 Human genes 0.000 description 1
- 208000036071 Rhinorrhea Diseases 0.000 description 1
- 206010039101 Rhinorrhoea Diseases 0.000 description 1
- 102000004389 Ribonucleoproteins Human genes 0.000 description 1
- 108010081734 Ribonucleoproteins Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 102100020814 Sequestosome-1 Human genes 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 102100036119 Serine/threonine-protein kinase pim-3 Human genes 0.000 description 1
- 101500025266 Severe acute respiratory syndrome coronavirus 2 2'-O-methyltransferase nsp16 Proteins 0.000 description 1
- 101000953880 Severe acute respiratory syndrome coronavirus 2 Membrane protein Proteins 0.000 description 1
- 101500023562 Severe acute respiratory syndrome coronavirus 2 Non-structural protein 2 Proteins 0.000 description 1
- 101500023577 Severe acute respiratory syndrome coronavirus 2 Non-structural protein 7 Proteins 0.000 description 1
- 101500023576 Severe acute respiratory syndrome coronavirus 2 Non-structural protein 8 Proteins 0.000 description 1
- 101500023575 Severe acute respiratory syndrome coronavirus 2 Non-structural protein 9 Proteins 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- PRXRUNOAOLTIEF-ADSICKODSA-N Sorbitan trioleate Chemical class CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCC\C=C/CCCCCCCC)[C@H]1OC[C@H](O)[C@H]1OC(=O)CCCCCCC\C=C/CCCCCCCC PRXRUNOAOLTIEF-ADSICKODSA-N 0.000 description 1
- 102100038501 Splicing factor U2AF 35 kDa subunit Human genes 0.000 description 1
- 102100027780 Splicing factor, proline- and glutamine-rich Human genes 0.000 description 1
- UCKMPCXJQFINFW-UHFFFAOYSA-N Sulphide Chemical compound [S-2] UCKMPCXJQFINFW-UHFFFAOYSA-N 0.000 description 1
- 102100039187 Transcription factor MafF Human genes 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108091026838 U1 spliceosomal RNA Proteins 0.000 description 1
- 108091026828 U2 spliceosomal RNA Proteins 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- 108010003533 Viral Envelope Proteins Proteins 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 206010047700 Vomiting Diseases 0.000 description 1
- 102100023905 YTH domain-containing protein 1 Human genes 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- WERKSKAQRVDLDW-ANOHMWSOSA-N [(2s,3r,4r,5r)-2,3,4,5,6-pentahydroxyhexyl] (z)-octadec-9-enoate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO WERKSKAQRVDLDW-ANOHMWSOSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 235000019666 ageusia Nutrition 0.000 description 1
- 230000004931 aggregating effect Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- 125000005083 alkoxyalkoxy group Chemical group 0.000 description 1
- 125000002877 alkyl aryl group Chemical group 0.000 description 1
- 125000005600 alkyl phosphonate group Chemical group 0.000 description 1
- 125000002947 alkylene group Chemical group 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 229940024606 amino acid Drugs 0.000 description 1
- 125000005122 aminoalkylamino group Chemical group 0.000 description 1
- 108010080146 androgen receptors Proteins 0.000 description 1
- PYKYMHQGRFAEBM-UHFFFAOYSA-N anthraquinone Natural products CCC(=O)c1c(O)c2C(=O)C3C(C=CC=C3O)C(=O)c2cc1CC(=O)OC PYKYMHQGRFAEBM-UHFFFAOYSA-N 0.000 description 1
- 150000004056 anthraquinones Chemical class 0.000 description 1
- 230000002424 anti-apoptotic effect Effects 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 239000003816 antisense DNA Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 125000003710 aryl alkyl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 239000000605 aspartame Substances 0.000 description 1
- IAOZJIPTCAWIRG-QWRGUYRKSA-N aspartame Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)OC)CC1=CC=CC=C1 IAOZJIPTCAWIRG-QWRGUYRKSA-N 0.000 description 1
- 235000010357 aspartame Nutrition 0.000 description 1
- 229960003438 aspartame Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- 238000002869 basic local alignment search tool Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- ZYGHJZDHTFUPRJ-UHFFFAOYSA-N benzo-alpha-pyrone Natural products C1=CC=C2OC(=O)C=CC2=C1 ZYGHJZDHTFUPRJ-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 229920000249 biocompatible polymer Polymers 0.000 description 1
- 230000008436 biogenesis Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 208000027499 body ache Diseases 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 239000004067 bulking agent Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 150000001841 cholesterols Chemical class 0.000 description 1
- 235000019416 cholic acid Nutrition 0.000 description 1
- BHQCQFFYRZLCQQ-OELDTZBJSA-N cholic acid Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 BHQCQFFYRZLCQQ-OELDTZBJSA-N 0.000 description 1
- 229960002471 cholic acid Drugs 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000013270 controlled release Methods 0.000 description 1
- 235000001671 coumarin Nutrition 0.000 description 1
- 125000000332 coumarinyl group Chemical class O1C(=O)C(=CC2=CC=CC=C12)* 0.000 description 1
- 125000001995 cyclobutyl group Chemical group [H]C1([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000012350 deep sequencing Methods 0.000 description 1
- 230000004665 defense response Effects 0.000 description 1
- 230000005860 defense response to virus Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- KXGVEGMKQFWNSR-UHFFFAOYSA-N deoxycholic acid Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 KXGVEGMKQFWNSR-UHFFFAOYSA-N 0.000 description 1
- 239000005549 deoxyribonucleoside Substances 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-N dithiophosphoric acid Chemical class OP(O)(S)=S NAGJZTKCGNOGPW-UHFFFAOYSA-N 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000003974 emollient agent Substances 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N ethylene glycol Natural products OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 206010016256 fatigue Diseases 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 229940014144 folate Drugs 0.000 description 1
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 1
- 235000019152 folic acid Nutrition 0.000 description 1
- 239000011724 folic acid Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000004554 glutamine Nutrition 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 125000003827 glycol group Chemical group 0.000 description 1
- 231100000869 headache Toxicity 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 125000000592 heterocycloalkyl group Chemical group 0.000 description 1
- FBPFZTCFMRRESA-UHFFFAOYSA-N hexane-1,2,3,4,5,6-hexol Chemical compound OCC(O)C(O)C(O)C(O)CO FBPFZTCFMRRESA-UHFFFAOYSA-N 0.000 description 1
- 229920001477 hydrophilic polymer Polymers 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 235000010979 hydroxypropyl methyl cellulose Nutrition 0.000 description 1
- 239000001866 hydroxypropyl methyl cellulose Substances 0.000 description 1
- 229920003088 hydroxypropyl methyl cellulose Polymers 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 238000010820 immunofluorescence microscopy Methods 0.000 description 1
- 238000003125 immunofluorescent labeling Methods 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000034184 interaction with host Effects 0.000 description 1
- 239000000138 intercalating agent Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- DTOSIQBPPRVQHS-PDBXOOCHSA-M linolenate Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC([O-])=O DTOSIQBPPRVQHS-PDBXOOCHSA-M 0.000 description 1
- 239000008206 lipophilic material Substances 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 125000001921 locked nucleotide group Chemical group 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 238000003670 luciferase enzyme activity assay Methods 0.000 description 1
- 210000005265 lung cell Anatomy 0.000 description 1
- 239000012931 lyophilized formulation Substances 0.000 description 1
- 239000008176 lyophilized powder Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 230000012976 mRNA stabilization Effects 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 239000002923 metal particle Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 235000010981 methylcellulose Nutrition 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 239000000693 micelle Substances 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 230000005787 mitochondrial ATP synthesis coupled electron transport Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000004879 molecular function Effects 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 238000011201 multiple comparisons test Methods 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- GZCNJTFELNTSAB-UHFFFAOYSA-N n'-(7h-purin-6-yl)hexane-1,6-diamine Chemical compound NCCCCCCNC1=NC=NC2=C1NC=N2 GZCNJTFELNTSAB-UHFFFAOYSA-N 0.000 description 1
- 229920005615 natural polymer Polymers 0.000 description 1
- 230000008693 nausea Effects 0.000 description 1
- 230000004134 neutrophil mediated immunity Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 125000001893 nitrooxy group Chemical group [O-][N+](=O)O* 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 230000030147 nuclear export Effects 0.000 description 1
- 238000001668 nucleic acid synthesis Methods 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- GYCKQBWUSACYIF-UHFFFAOYSA-N o-hydroxybenzoic acid ethyl ester Natural products CCOC(=O)C1=CC=CC=C1O GYCKQBWUSACYIF-UHFFFAOYSA-N 0.000 description 1
- 230000009437 off-target effect Effects 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 125000001181 organosilyl group Chemical group [SiH3]* 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 108010068338 p38 Mitogen-Activated Protein Kinases Proteins 0.000 description 1
- 239000003002 pH adjusting agent Substances 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 125000000913 palmityl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- ONTNXMBMXUNDBF-UHFFFAOYSA-N pentatriacontane-17,18,19-triol Chemical compound CCCCCCCCCCCCCCCCC(O)C(O)C(O)CCCCCCCCCCCCCCCC ONTNXMBMXUNDBF-UHFFFAOYSA-N 0.000 description 1
- 239000002304 perfume Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 150000008298 phosphoramidates Chemical class 0.000 description 1
- 125000004437 phosphorous atom Chemical group 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000018127 platelet degranulation Effects 0.000 description 1
- 229920000570 polyether Polymers 0.000 description 1
- 210000002729 polyribosome Anatomy 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 239000001103 potassium chloride Substances 0.000 description 1
- 235000011164 potassium chloride Nutrition 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000019866 protein targeting to membrane Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000009711 regulatory function Effects 0.000 description 1
- 125000006853 reporter group Chemical group 0.000 description 1
- 230000027756 respiratory electron transport chain Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 239000002342 ribonucleoside Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 235000019204 saccharin Nutrition 0.000 description 1
- 229940081974 saccharin Drugs 0.000 description 1
- 239000000901 saccharin and its Na,K and Ca salt Substances 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 208000013220 shortness of breath Diseases 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 239000002924 silencing RNA Substances 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 235000010413 sodium alginate Nutrition 0.000 description 1
- 239000000661 sodium alginate Substances 0.000 description 1
- 229940005550 sodium alginate Drugs 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000001540 sodium lactate Substances 0.000 description 1
- 229940005581 sodium lactate Drugs 0.000 description 1
- 235000011088 sodium lactate Nutrition 0.000 description 1
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 229940083466 soybean lecithin Drugs 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 230000005531 stress granule assembly Effects 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 229960004793 sucrose Drugs 0.000 description 1
- IIACRCGMVDHOTQ-UHFFFAOYSA-N sulfamic acid Chemical group NS(O)(=O)=O IIACRCGMVDHOTQ-UHFFFAOYSA-N 0.000 description 1
- 150000003456 sulfonamides Chemical group 0.000 description 1
- BDHFUVZGWQCTTF-UHFFFAOYSA-M sulfonate Chemical compound [O-]S(=O)=O BDHFUVZGWQCTTF-UHFFFAOYSA-M 0.000 description 1
- 150000003457 sulfones Chemical group 0.000 description 1
- 150000003462 sulfoxides Chemical class 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 150000003568 thioethers Chemical class 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- ZMANZCXQSJIPKH-UHFFFAOYSA-O triethylammonium ion Chemical compound CC[NH+](CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-O 0.000 description 1
- 125000000876 trifluoromethoxy group Chemical group FC(F)(F)O* 0.000 description 1
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 1
- 125000002948 undecyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 210000003501 vero cell Anatomy 0.000 description 1
- 230000007332 vesicle formation Effects 0.000 description 1
- 230000006648 viral gene expression Effects 0.000 description 1
- 230000010464 virion assembly Effects 0.000 description 1
- 230000008673 vomiting Effects 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000012130 whole-cell lysate Substances 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
- C12N15/1131—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing against viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/14—Type of nucleic acid interfering N.A.
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Definitions
- BACKGROUND COVID-19 is caused by the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a positive-sense single-stranded (+ss) RNA virus.
- SARS-CoV-2 novel severe acute respiratory syndrome coronavirus 2
- the viral genome encodes 29 proteins, which include the four structural proteins, membrane or matrix (M), nucleocapsid (N), envelope (E), and spike (S) proteins.
- M membrane or matrix
- N nucleocapsid
- E envelope
- S spike
- NSP1-16 non-structural proteins
- ORF3a-ORF10 accessory proteins
- NSP1 binds to the mRNA entry site on the host ribosomal RNA to inhibit host translation.
- SARS-CoV-2 nucleocapsid protein interactome comprises many host RNA processing machinery proteins and stress granule proteins, suggesting a potential role in interfering with host RNA processing and driving stress granule formation.
- SARS-CoV-2 proteins bind to ⁇ 140 host transcripts. Focusing on non- coding RNAs, SARSCoV-2 NSP16 was found to bind to U1/U2 snRNA to interfere with splicing, while NSP8 and NSP9 bind to the signal recognition ribonucleoprotein 7SK to block protein trafficking.
- compositions comprising an inhibitory nucleic acid that is complementary to a portion of a target RNA.
- the target RNA comprises a portion of the SARS-CoV-2 genome.
- the target RNA comprises a portion of human RNA.
- the target RNA comprises human mRNA.
- the target RNA is associated with a portion of the 5’ untranslated region (UTR) of the target RNA. In some embodiments, the target RNA is associated with a portion of the coding region of the target RNA. In some embodiments, the target RNA is associated with a portion of the 3’ untranslated region of the target RNA.
- a SARS-CoV-2 protein selected from the group consisting of NSP2, NSP3, NSP5, NSP6, NSP7, NSP9, NSP12, NSP14, NSP15, ORF3b, ORF7b and ORF9c is capable of binding to a portion of the target RNA.
- the target RNA is associated with a gene selected from the group consisting of RPN1, UGGT1, OST4, NDUFA4, HSPA5, PSMD13, LAPTM4A, LAMP1, C1QBP, and LDHB.
- the inhibitory nucleic acid comprises one or more small interfering RNAs (siRNA).
- the one or more siRNAs comprise at least one sequence selected from the group consisting of SEQ ID NOs: 1-40.
- the inhibitory nucleic acid comprises one or more antisense oligonucleotides.
- the inhibitory nucleic acid comprises one or more locked nucleic acids (LNA).
- the one or more LNAs comprises a sequence selected from the group consisting of SEQ ID NOs: 52 and 53.
- the inhibitory nucleic acid comprises one or more phosphorothioate substitutions.
- the inhibitory nucleic acid comprises one or more 2’-O-methyl modifications.
- the inhibitory nucleic acid comprises one or more 2’-O-methoxyethyl modifications.
- the inhibitory nucleic acid comprises one or more 2 - fluoro modifications.
- the one or more siRNAs, antisense oligonucleotides, or locked nucleic acids are a plurality, wherein each siRNA, antisense oligonucleotide, or locked nucleic acid comprises the same sequence as the other siRNAs, antisense oligonucleotides, or locked nucleic acids in the plurality.
- the one or more siRNAs, antisense oligonucleotides, or locked nucleic acids contain a mixture of siRNAs, antisense oligonucleotides, or locked nucleic acids comprising at least two different sequences.
- the LNA is enclosed within a vector.
- the vector is a lentivirus. In some embodiments, the vector is an adenovirus. In some embodiments, the vector is an adeno-associated viral vector (AAV). In some embodiments, the vector is a liposome. Also provided herein are methods of treating Coronavirus disease 2019 (COVID-19) in a subject, the method comprising: administering to the subject a therapeutically effective amount of any one of the pharmaceutical compositions described herein, thereby treating Coronavirus disease 2019 (COVID-19) in the subject. In some embodiments, the therapeutically effective amount of the pharmaceutical composition decreases target RNA protein expression.
- inhibitory nucleic acids which is or comprises SEQ ID NOs: 1-40, wherein the inhibitory nucleic acid comprises a siRNA.
- inhibitory nucleic acids which is or comprises SEQ ID NOs: 52 or 53, wherein the inhibitory nucleic acid comprises a LNA.
- 1A is an exemplary schematic showing eCLIP performed on SARS-CoV-2 proteins in virus infected Vero E6 cells.
- Proteins in infected cells were UV crosslinked to bound transcripts, which were immunoprecipitated (IP) with antibodies that recognize NSP8 (primase), NSP12 (RNA dependent RNA polymerase, RdRp) and N (nucleocapsid) proteins.
- IP immunoprecipitated
- NSP8 primaryase
- NSP12 RNA dependent RNA polymerase
- N nucleocapsid
- FIG. 1C shows NSP12 eCLIP zoomed into yellow highlighted regions in FIG.1B. Top row, NSP12 eCLIP; bottom row, SHAPE Shannon entropy with a sliding median of 55 nt. Shaded region in bottom row is partitioned at the global median entropy.
- FIG. 1D shows correlation between normalized SHAPE entropy and normalized log2 (Fold Change) of IP over INPUT eCLIP read density i.e.
- FIG.1E shows secondary structure of the NSP12 eCLIP peak region from position 7412-7545.
- FIG. 1F shows results from filter-binding assay showing RNA crosslinked to NSP12 on the nitrocellulose layer, and the free RNA on the nylon layer.
- FIG.1G shows fraction of RNA bound to NSP12 from filter binding assay, using hairpin RNA from position 7414-7555 and scrambled RNA as negative control.
- FIG.1I shows a locked nucleic acid (LNA) designed to be complementary to the hairpin RNA from position 7414-7555.
- FIG.1J shows complementary LNA targeting SARS-CoV-2 genome shown to hybridize and disrupt the structure of the central hairpin as predicted by NUPACK software.
- FIG.1K shows immunofluorescence microscopy images showing fraction of infected cells as stained by SARS-CoV-2 nucleocapsid proteins (green), with cell nucleus stained with DAPI (green).
- FIG.1I shows a locked nucleic acid (LNA) designed to be complementary to the hairpin RNA from position 7414-7555.
- FIG.1J shows complementary LNA targeting SARS-CoV-2 genome shown to hybridize and disrupt the structure of the central hairpin as predicted by NUPACK software.
- FIG.1K shows immunofluorescence microscopy images showing fraction of infected cells as stained by SARS-CoV-2 nucleocapsid proteins (green), with cell nucleus stained with DA
- FIG. 1L shows a bar plot showing mean relative fluorescence intensities of cells from FIG. 1K, dots represent segmented individual cells (mean ⁇ s.e.m. * p ⁇ 0.01, two-tailed t-test).
- FIG. 1L shows a bar plot showing mean relative fluorescence intensities of cells from FIG. 1K, dots represent segmented individual cells (mean ⁇ s.e.m. * p ⁇ 0.01, two-tailed t-test).
- FIG.2A shows a bar plot showing number of all genes, number of all peaks, number of coding genes and number of peaks mapping to coding genes from
- FIG. 2B shows a stacked bar plot showing TPM of reads mapped to the Vero E6 genome or SARS-CoV-2 genome in each of NSP12, NSP8 and N eCLIP.
- FIG. 2C shows a Venn diagram showing number of African Green Monkey (host) genes targeted by NSP8 and NSP12.
- FIG.2D shows a Violin plot showing the distribution of Log2FoldChange in transcript levels in Vero E6 cells infected by SARS-CoV-2, for significantly differentially expressed genes (adjusted P ⁇ 0.05). Kolmogorov–Smirnov test p-values between eCLIP targets of NSP12 and NSP8 versus all differentially expressed genes are indicated above the plot.
- FIG.2E shows Top 25 Enriched Gene Ontology (GO) processes (adjusted p ⁇ 0.01) for NSP12 target host genes. Box plot indicates quartiles of differential expression (log2(Fold Change)) of target genes (grey dots).
- FIG. 3A shows an exemplary schematic showing SARS-CoV-2 proteins individually tagged and expressed in human lung epithelial cells BEAS-2B to assay with eCLIP.
- FIG. 3C shows a clustermap showing unique host coding genes (columns) targeted by each SARS-CoV-2 protein (rows).
- FIG. 3D shows example genome browser tracks for NSP3, NSP12, N and NSP2 mapping to DYNCH1, TUSC3, CXCL5 and NAP1L4 respectively.
- FIG.3E-3F show Western blots showing viral (pink background) and human (blue background) proteins enriched via CLASP (FIG.3E) and RIC (FIG.3F), with total cell lysate showed in input column (IN).
- FIG.3G shows Enriched Gene Ontology (GO) processes (adjusted p-value ⁇ 10-5) of unique eCLIP target coding genes for various SARS-CoV-2 proteins.
- FIG. 3H shows example sequence logos generated from all IDR peak reads for each SARS- CoV-2 eCLIP, with p-value indicated above each logo.
- FIG.4A shows a stacked bar plot showing fraction of reproducible peaks (by IDR14) mapping to different regions of coding genes.3ss, 3’ splice site; 3utr, 3’ untranslated region (UTR), 5ss, 5’ splice site; 5utr, 5’ UTR; CDS, coding sequence.
- FIG.4B shows a clustermap showing read density of target RNA by each SARS-CoV-2 protein scaled to a metagene profile containing 5’ UTR, CDS and 3’ UTR regions.
- FIG.4C shows an exemplary schematic showing the Renilla-MS2 and Firefly dual luciferase reporter constructs, where individual SARS-CoV-2 proteins fused to MCP are recruited to the Renillia-MS2 mRNA.
- FIGs.4D-4E show bar plots showing luciferase reporter activity ratios (FIG.4D) and reporter RT-qPCR ratios (FIG.4E) for the indicated coexpressed SARS-CoV-2 protein, known human regulators of RNA stability (CNOT7, BOLL) and negative control (FLAG peptide).
- FIG.4G shows a cumulative distributive plot (CDF) of log2(Fold Change) of gene expression in HEK293T cells transfected with a plasmid overexpressing NSP12 versus an empty vector plasmid.
- KS test p values indicate significance of difference in differential expression of NSP12 target genes versus non-eCLIP target genes.
- FIG.4H shows Enriched Gene Ontology (GO) processes (adjusted p ⁇ 10-4) of NSP12 target genes, with box plots indicating quartiles of differential expression (log2(Fold Change)) of target genes (black dots).
- FIG. 4I shows a CDF plot of ⁇ log2(Fold Change) of polysomal mRNA levels in BEAS-2B cells nucleofected with a plasmid overexpressing ORF9c versus an empty vector plasmid.
- KS test p values indicate significance of difference in differential expression of ORF9c target genes versus non-eCLIP target genes.
- FIG.4J shows Enriched BioPlanet pathways (adjusted p ⁇ 0.01) of ORF9c target genes, with box plots indicating quartiles of differential expression ( ⁇ log2(Fold Change) of polysomal mRNA levels) of target genes (black dots).
- FIG. 4K shows immunofluorescence images (40X) of SARS-CoV-2 infected A549-ACE2 cells stained for SARS-CoV-2 NSP8 (red), endogenous genes (green), DNA content (blue).
- FIG. 4L shows a heat map showing infection rate as measured by the integrated intensity of immunofluorescence staining of SARS-CoV-2 nucleocapsid protein in human iPSC derived lung organoid cells. Cells were treated with siRNAs targeting different host genes prior to viral infection by three different variants of SARS-CoV-2.
- FIG.5A shows a pie charts showing distribution of eCLIP peaks across different coding RNA regions for NSP2, NSP5, NSP7 and NSP9. Genomic content and exonic content are based on the hg19 human reference genome.
- FIG. 5B shows Jaccard index similarity of NSP9 target genes as compared with all 223 ENCODE RBP datasets.
- FIG.5C shows metadensity of eCLIP reads truncation sites averaged across all RNA targets by SARS-CoV-2 NSP2, NSP5, NSP7 and NSP9, and U2AF1/2 from the ENCODE consortium, zoomed into the region 150 nt upstream of 3’ splice sites, and the region 150 nt downstream of the 5’ end of the last exon.
- FIG.5D shows an exemplary schematic illustrating a model of NSP9 interacting with nuclear pore complex proteins NUP62, NUP214, NUP58, NUP88 and NUP54, and inhibiting U2AF2 substrate recognition in preventing NXF1 facilitated transport.
- FIG. 5C shows metadensity of eCLIP reads truncation sites averaged across all RNA targets by SARS-CoV-2 NSP2, NSP5, NSP7 and NSP9, and U2AF1/2 from the ENCODE consortium, zoomed into the region 150 nt upstream of 3’ splice sites
- FIG. 5E shows cumulative distributive plot (CDF) of log2(Fold Change) of BEAS-2B cells overexpressing NSP9 versus wildtype BEAS-2B cells in each of nuclear, cytosolic, and total mRNA fractions. Solid line indicate NSP9 target genes, dashed lines indicate genes that are not NSP9 targets.
- FIG. 5F shows genome browser tracks of NSP9 eCLIP target RNA mapped to IL-1 ⁇ , IL-1 ⁇ , ANXA2 and UPP1.
- FIG. 6A shows an exemplary schematic illustrating the complex host-viral relationship.
- Flat- ended arrows indicate downregulation
- pointed arrows indicate upregulation. Blue arrows are newly proposed interactions.
- FIG.6B shows an exemplary schematic illustrating the hypothesis that siRNA knockdown of host genes upregulated by SARS-CoV-2 proteins would result in reduced virus proliferation if the genes are pro-viral.
- FIG. 6C shows an exemplary schematic outlining the workflow of testing the antiviral effect of siRNAs targeting candidate pro-viral genes.
- siRNAs are transfected at 25 nM. Cells are infected at MOI of 3 for A549-ACE2 cells and MOI of 1 for lung organoid cells.
- 6D-6E show bar plots showing the infection rate (mean fluorescence intensity of nucleocapsid divided by cell count) by normalized to the scrambled siRNA control in A549- ACE2 cells (FIG.6D) and lung organoid cells (FIG.6E).
- FIG. 7A shows secondary structures that overlap with the major peaks enriched in NSP12 eCLIP at positions 3533-3635, 17202-17222, 21177-21206, and 24018-24079.
- FIG.7B shows results from a filter-binding assay showing RNA crosslinked to NSP12 on the nitrocellulose layer, and the free RNA on the nylon layer.
- FIG. 7C shows multiple sequence alignment of the region corresponding among the SARS- CoV-2 reference sequence, and homologous bat and pangolin sequences. Consensus sequence and structure are indicated underneath the aligned sequences.
- FIG.7D shows a phylogenetic tree constructed from FIG.7C.
- a target RNA that include delivering an exogenous nucleic acid into a cell, wherein the exogenous nucleic acid comprises an antisense oligonucleotide or a small interfering RNA (siRNA), and wherein the exogenous nucleic acid binds to the target RNA, thereby modulating gene expression of the target RNA in the cell.
- siRNA small interfering RNA
- a “cell” can refer to either a prokaryotic or eukaryotic cell, optionally obtained from a subject or a commercially available source.
- delivering can refer to the introduction of an exogenous polynucleotide into a host cell, irrespective of the method used for the introduction.
- Such methods include a variety of well-known techniques such as vector-mediated gene transfer (e.g., viral infection/transfection, or various other protein-based or lipid-based gene delivery complexes) as well as techniques facilitating the delivery of “naked” polynucleotides (e.g., electroporation, “gene gun” delivery and various other techniques used for the introduction of polynucleotides).
- the introduced polynucleotide may be stably or transiently maintained in the host cell.
- Stable maintenance typically requires that the introduced polynucleotide either contains an origin of replication compatible with the host cell or integrates into a replicon of the host cell such as an extrachromosomal replicon (e.g., a plasmid) or a nuclear or mitochondrial chromosome.
- a polynucleotide can be inserted into a host cell by a gene delivery molecule.
- gene delivery molecules can include, but are not limited to, liposomes, micelles biocompatible polymers, including natural polymers and synthetic polymers; lipoproteins; polypeptides; polysaccharides; lipopolysaccharides; artificial viral envelopes; metal particles; and bacteria, or viruses, such as baculovirus, adenovirus and retrovirus, bacteriophage, cosmid, plasmid, fungal vectors and other recombination vehicles typically used in the art which have been described for expression in a variety of eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple protein expression.
- the term “encode” as it is applied to nucleic acid sequences refers to a polynucleotide which is said to “encode” a polypeptide if, in its native state or when manipulated by methods well known to those skilled in the art, can be transcribed and/or translated to produce the mRNA for the polypeptide and/or a fragment thereof.
- the antisense strand is the complement of such a nucleic acid, and the encoding sequence can be deduced therefrom.
- the term “exogenous” refers to any material introduced from or originating from outside a cell, a tissue or an organism that is not produced by or does not originate from the same cell, tissue, or organism in which it is being introduced.
- expression refers to the process by which polynucleotides are transcribed into mRNA and/or the process by which the transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins.
- expression may include splicing of the mRNA in a eukaryotic cell.
- the expression level of a gene may be determined by measuring the amount of mRNA or protein in a cell or tissue sample; further, the expression level of multiple genes can be determined to establish an expression profile for a particular sample.
- nucleic acid is used to include any compound and/or substance that comprise a polymer of nucleotides.
- a polymer of nucleotides are referred to as polynucleotides.
- Exemplary nucleic acids or polynucleotides can include, but are not limited to, ribonucleic acids (RNAs), deoxyribonucleic acids (DNAs), threose nucleic acids (TNAs), glycol nucleic acids (GNAs), peptide nucleic acids (PNAs), locked nucleic acids (LNAs, including LNA having a ⁇ -D-ribo configuration, ⁇ -LNA having an ⁇ -L-ribo configuration (a diastereomer of LNA), 2’-amino-LNA having a 2’-amino functionalization, and 2’-amino- ⁇ -LNA having a 2’-amino functionalization) or hybrids thereof.
- RNAs ribonucleic acids
- Naturally- occurring nucleic acids generally have a deoxyribose sugar (e.g., found in deoxyribonucleic acid (DNA)) or a ribose sugar (e.g., found in ribonucleic acid (RNA)).
- a nucleic acid can contain nucleotides having any of a variety of analogs of these sugar moieties that are known in the art.
- a deoxyribonucleic acid can have one or more bases selected from the group consisting of adenine (A), thymine (T), cytosine (C), or guanine (G), and a ribonucleic acid (RNA) can have one or more bases selected from the group consisting of uracil (U), adenine (A), cytosine (C), or guanine (G).
- the term “nucleic acid” refers to a deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), or a combination thereof, in either a single- or double-stranded form.
- the term encompasses nucleic acids containing known analogues of natural nucleotides that have similar binding properties as the reference nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses complementary sequences as well as the sequence explicitly indicated.
- the isolated nucleic acid is DNA. In some embodiments of any of the isolated nucleic acids described herein, the isolated nucleic acid is RNA. Modifications can be introduced into a nucleotide sequence by standard techniques known in the art, such as site-directed mutagenesis and polymerase chain reaction (PCR)-mediated mutagenesis.
- PCR polymerase chain reaction
- Conservative amino acid substitutions are ones in which the amino acid residue is replaced with an amino acid residue having a similar side chain.
- Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., arginine, lysine and histidine), acidic side chains (e.g., aspartic acid and glutamic acid), uncharged polar side chains (e.g., asparagine, cysteine, glutamine, glycine, serine, threonine, tyrosine, and tryptophan), nonpolar side chains (e.g., alanine, isoleucine, leucine, methionine, phenylalanine, proline, and valine), beta-branched side chains (e.g., isoleucine, threonine, and valine), and aromatic side chains (e.g., histidine, phenylalanine, tryptophan, and tyrosine), and aromatic
- nucleotide sequence encoding a protein includes all nucleotide sequences that are degenerate versions of each other and thus encode the same amino acid sequence.
- the term “plurality” can refer to a state of having a plural (e.g., more than one) number of different types of things (e.g., a cell, a genomic sequence, a subject, a system, or a protein).
- a plurality of genomic sequences can be more than one genomic sequence wherein each genomic sequence is different from each other.
- subject is intended to include any mammal.
- the subject is cat, a dog, a goat, a human, a non-human primate, a rodent (e.g., a mouse or a rat), a pig, or a sheep.
- the term “transduced”, “transfected”, or “transformed” refers to a process by which exogenous nucleic acid is introduced or transferred into a cell.
- a “transduced,” “transfected,” or “transformed” mammalian cell is one that has been transduced, transfected or transformed with exogenous nucleic acid (e.g., a gene delivery vector) that includes an exogenous nucleic acid encoding RNA-binding zinc finger domain).
- the term “treating” means a reduction in the number, frequency, severity, or duration of one or more (e.g., two, three, four, five, or six) symptoms of a disease or disorder in a subject (e.g., any of the subjects described herein), and/or results in a decrease in the development and/or worsening of one or more symptoms of a disease or disorder in a subject.
- Inhibitory Nucleic Acids Disclosed herein are inhibitory nucleic acids which block the interaction between a host and SARS-CoV2-associated RNA or protein.
- the inhibitory nucleic acids are oligonucleotides.
- the inhibitory nucleic acids are antisense oligonucleotides.
- the inhibitory nucleic acids are modified oligonucleotides.
- the inhibitory nucleic acids are locked nucleic acids (LNAs).
- the modified oligonucleotides are antisense oligonucleotides that hybridize to viral RNA.
- the inhibitory nucleic acids are designed to target a specific region of viral RNA. For example, a specific functional region can be targeted.
- the inhibitory nucleic acids hybridize to a viral genome.
- the inhibitory nucleic acids are complementary to and/or hybridize to SARS-CoV2.
- the inhibitory nucleic acids can be used in model systems, e.g., lung organoids.
- the inhibitory nucleic acids are complementary to and/or hybridize and/or bind to host RNA.
- the inhibitory nucleic acids hybridize to the polyA tail of host mRNA.
- the inhibitory nucleic acids can be used to treat viral infections in a host.
- the inhibitory nucleic acids can be used in pharmaceutical compositions to achieve a therapeutic effect, e.g., reducing or alleviating the symptoms of a disease in a patient.
- the disease is a viral infection.
- the therapeutic effect includes a reduction or elimination of symptoms including fever or chills, cough, shortness of breath or difficulty breathing, fatigue, muscle or body aches, headache, new loss of taste or smell, sore throat, congesting or runny nose, nausea or vomiting, diarrhea.
- the pharmaceutical composition comprises an inhibitory nucleic acid, e.g., an antisense oligonucleotide that is complementary to one or more viral RNAs, which may cause the disease in the patient.
- the inhibitory nucleic acid may be complementary to one or more host RNAs.
- the antisense oligonucleotide includes different modifications, e.g., in the sugar backbone to make it more cell permeable and nuclease resistant and physiologically non-toxic at low concentrations.
- Inhibitory nucleic acids useful in the present methods and compositions include antisense oligonucleotides, modified bases/locked nucleic acids (LNAs), antagomirs, peptide nucleic acids (PNAs), double stranded RNA species, siRNAs, morpholinos, and other oligomeric compounds or oligonucleotide mimetics which hybridize to at least a portion of the target nucleic acid (i.e., an viral RNA gene, e.g., a SARS-CoV2 gene) and modulate its abundance, splicing, post-transcriptional processing, or translation; see, e.g., U.S. Patent Nos. 9,045,749 and 9,476,046.
- LNAs locked nucleic acids
- PNAs peptide nucleic acids
- siRNAs e.g., a SARS-CoV2 gene
- the inhibitory nucleic acids include antisense RNA, antisense DNA, chimeric antisense oligonucleotides, antisense oligonucleotides comprising modified linkages, or combinations thereof. See, e.g., WO 2010040112.
- the inhibitory nucleic acids are LNAs.
- a locked nucleic acid is a nucleotide having a modified ribose moiety in which the ribose moiety comprises an extra bridge connecting the 2′ and 4′ carbons. This structure effectively “locks” the ribose in the 3′- endo structural conformation.
- LNA molecules can include molecules comprising 10-30, e.g., 12-24, e.g., 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in each strand, wherein one of the strands is substantially identical, e.g., at least 80% (or more, e.g., 85%, 90%, 95%, or 100%) identical, e.g., having 3, 2, 1, or 0 mismatched nucleotide(s), to a SARS-CoV2 sequence.
- the LNA molecules can be chemically synthesized using methods known in the art.
- the LNA molecules can be designed using any method known in the art; a number of algorithms are known, and are commercially available (e.g., on the internet, for example at exiqon.com). See, e.g., You et al., Nuc. Acids. Res. 34:e60 (2006); McTigue et al., Biochemistry 43:5388-405 (2004); and Levin et al., Nuc. Acids. Res. 34:e142 (2006).
- “gene walk” methods similar to those used to design antisense oligos, can be used to optimize the inhibitory activity of the LNA (or any other inhibitory nucleic acid described herein); for example, a series of oligonucleotides of 10-30 nucleotides spanning the length of a target SARS-CoV2 sequence can be prepared, followed by testing for activity.
- gaps e.g., of 5-10 nucleotides or more, can be left between the LNAs to reduce the number of oligonucleotides synthesized and tested.
- GC content is preferably between about 30 and 60%.
- LNA sequences will bind very tightly to other LNA sequences, so it is preferable to avoid significant complementarity within an LNA. Contiguous runs of three or more Gs or Cs, or more than four LNA residues, should be avoided where possible (for example, it may not be possible with very short (e.g., about 9-10 nt) oligonucleotides).
- the LNAs are xylo- LNAs.
- the inhibitory nucleic acids are siRNA.
- siRNA refers to small inhibitory RNA duplexes that induce the RNA interference (RNAi) pathway.
- RNA small interfering RNA
- siRNA sometimes known as “short interfering RNA” or “silencing RNA”
- RNAi RNA interference
- RNA interference can offer a therapeutic approach for treating viral infections, wherein small interfering RNAs (siRNAs) can be used to target a viral sequence and inhibit (i.e., reduce or eliminate) viral gene expression, and also for treatment of a viral infection in a mammal, such as a human.
- the small interfering RNA constructs can inhibit gene expression of a virus by inducing cleavage of viral polynucleotide sequences within or near the target RNA sequence that is recognized by the antisense sequence of the small interfering RNA.
- siRNAs interfere with the expression of specific genes with complementary nucleotide sequences by degrading mRNA after transcription, preventing translation.
- RNA molecules can vary in length (generally 18-30 base pairs) and contain varying degrees of complementarity to their target mRNA in the antisense strand. Some, but not all, siRNA have unpaired overhanging bases on the 5′ or 3′ end of the sense strand and/or the antisense strand.
- the term “siRNA” includes duplexes of two separate strands, as well as single strands that can form hairpin structures comprising a duplex region.
- a small interfering RNA can refer to an RNA construct that contains one or more short sequences that are at least partially complementary to and can interact with a polynucleotide sequence of a virus (e.g., viral RNA).
- Interaction may be in the form of a direct binding between complementary (antisense) sequences of the small interfering RNA and polynucleotide sequences of the viral target, or in the form of an indirect interaction via enzymatic machinery (e.g., a protein complex) that allows the antisense sequence of the small interfering RNA to recognize the target sequence.
- enzymatic machinery e.g., a protein complex
- recognition of the target sequence by the small interfering RNA results in cleavage of viral sequences within or near the target site that is recognized by the recognition (antisense) sequence of the small interfering RNA.
- the small interfering RNA can exclusively contain ribonucleotide residues, or the small interfering RNA can contain one or more modified residues, particularly at the ends of the small interfering RNA or on the sense strand of the small interfering RNA.
- small interfering RNA encompasses shRNA and siRNA, both of which are understood and known to those in the art to refer to RNA constructs with particular characteristics and types of configurations.
- the inhibitory nucleic acids are antisense oligonucleotides. Antisense oligonucleotides are typically designed to inhibit expression of a DNA or RNA target by binding to the target and halting expression at the level of transcription, translation, or splicing.
- Antisense oligonucleotides of the present compositions and methods are complementary nucleic acid sequences designed to a target viral RNA to achieve a therapeutic effect in a patient.
- inhibitory nucleic acids and/or oligonucleotides are chosen that are sufficiently complementary to the target, i.e., that hybridize sufficiently well and with sufficient specificity, to give the desired therapeutic effect.
- the inhibitory nucleic acids are 9 to 50, 9 to 21, 13 to 50, or 13 to 30 nucleotides in length.
- oligonucleotides having antisense portions of 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides in length, or any range therein.
- the oligonucleotides are 15 nucleotides in length.
- the antisense or oligonucleotide compounds of the invention are 12 or 13 to 30 nucleotides in length.
- inhibitory nucleic acids having antisense portions of 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 nucleotides in length, or any range therein.
- the inhibitory nucleic acids are chimeric oligonucleotides that contain two or more chemically distinct regions, each made up of at least one nucleotide.
- oligonucleotides typically contain at least one region of modified nucleotides that confers one or more beneficial properties (such as, for example, increased nuclease resistance, increased uptake into cells, increased binding affinity for the target) and a region that is a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids.
- Chimeric inhibitory nucleic acids of the invention may be formed as composite structures of two or more oligonucleotides, modified oligonucleotides, oligonucleosides and/or oligonucleotide mimetics as described above. Such compounds have also been referred to in the art as hybrids or gapmers.
- the oligonucleotide is a gapmer (contain a central stretch (gap) of DNA monomers sufficiently long to induce RNase H cleavage, flanked by blocks of LNA modified nucleotides; see, e.g., Stanton et al., Nucleic Acid Ther. 2012. 22: 344-359; Nowotny et al., Cell, 121:1005–1016, 2005; Kurreck, European Journal of Biochemistry 270:1628-1644, 2003; Fluiter et al., Mol Biosyst.5(8):838-43, 2009).
- gap central stretch
- the oligonucleotide is a mixmer (includes alternating short stretches of LNA and DNA; Naguibneva et al., Biomed Pharmacother. 2006 Nov; 60(9):633-8; ⁇ rom et al., Gene. 2006 May 10; 372():137-41).
- Representative United States patents that teach the preparation of such hybrid structures comprise, but are not limited to, US patent nos. 5,013,830; 5,149,797; 5,220,007; 5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, each of which is herein incorporated by reference.
- the inhibitory nucleic acid comprises at least one nucleotide modified at the 2' position of the sugar, most preferably a 2'-O-alkyl, 2'-O-alkyl-O-alkyl or 2'- fluoro-modified nucleotide.
- RNA modifications include 2'- fluoro, 2'-amino and 2' O-methyl modifications on the ribose of pyrimidines, abasic residues or an inverted base at the 3' end of the RNA.
- Such modifications are routinely incorporated into oligonucleotides and these oligonucleotides have been shown to have a higher Tm (i.e., higher target binding affinity) than; 2'-deoxyoligonucleotides against a given target.
- Tm i.e., higher target binding affinity
- a number of nucleotide and nucleoside modifications have been shown to make the oligonucleotide into which they are incorporated more resistant to nuclease digestion than the native oligodeoxynucleotide; these modified oligos survive intact for a longer time than unmodified oligonucleotides.
- modified oligonucleotides include those comprising modified backbones, for example, phosphorothioates, phosphotriesters, methyl phosphonates, short chain alkyl or cycloalkyl intersugar linkages or short chain heteroatomic or heterocyclic intersugar linkages.
- oligonucleotides with phosphorothioate backbones and those with heteroatom backbones particularly CH 2 -NH-O-CH 2 , CH,-N(CH 3 )- O-CH2 (known as a methylene(methylimino) or MMI backbone], CH 2 --O--N (CH 3 )-CH 2 , CH 2 -N (CH 3 )-N (CH 3 )-CH 2 and O-N (CH 3 )- CH 2 -CH 2 backbones, wherein the native phosphodiester backbone is represented as O- P-- O- CH,); amide backbones (see De Mesmaeker et al. Ace. Chem. Res.
- PNA peptide nucleic acid
- Phosphorus-containing linkages include, but are not limited to, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates comprising 3'alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates comprising 3'-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3'-5' to 5'-3' or 2'-5' to 5'-2'; see US patent nos.3,687,808; 4,469,863; 4,476,301; 5,023,243; 5, 177
- Morpholino-based oligomeric compounds are described in Dwaine A. Braasch and David R. Corey, Biochemistry, 2002, 41(14), 4503-4510); Genesis, volume 30, issue 3, 2001; Heasman, J., Dev. Biol., 2002, 243, 209-214; Nasevicius et al., Nat. Genet., 2000, 26, 216-220; Lacerra et al., Proc. Natl. Acad. Sci., 2000, 97, 9591-9596; and U.S. Pat. No.5,034,506, issued Jul.23, 1991.
- Modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages.
- These comprise those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH 2 component parts; see US patent nos.
- One or more substituted sugar moieties can also be included, e.g., one of the following at the 2' position: OH, SH, SCH 3 , F, OCN, OCH 3 OCH 3 , OCH 3 O(CH 2 )n CH 3 , O(CH 2 )n NH 2 or O(CH 2 )nCH 3 where n is from 1 to about 10; Ci to C10 lower alkyl, alkoxyalkoxy, substituted lower alkyl, alkaryl or aralkyl; Cl; Br; CN; CF3 ; OCF 3 ; O-, S , or N-alkyl; O-, S-, or N-alkenyl; SOCH 3 ; SO2 CH 3 ; ONO 2 ; NO 2 ; N 3 ; NH 2 ; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; substituted silyl; an RNA cleaving group; a reporter group; an intercal
- a preferred modification includes 2'-methoxyethoxy [2'- O-CH 2 CH 2 OCH 3 , also known as 2'-O-(2-methoxyethyl)] (Martin et al, HeIv. Chim. Acta, 1995, 78, 486).
- Other preferred modifications include 2'-methoxy (2'-O-CH 3 ), 2'-propoxy (2'-OCH 2 CH 2 CH 3 ) and 2'-fluoro (2'-F). Similar modifications may also be made at other positions on the oligonucleotide, particularly the 3' position of the sugar on the 3' terminal nucleotide and the 5' position of 5' terminal nucleotide.
- Oligonucleotides may also have sugar mimetics such as cyclobutyls in place of the pentofuranosyl group.
- Inhibitory nucleic acids can also include, additionally or alternatively, nucleobase (often referred to in the art simply as “base”) modifications or substitutions.
- nucleobase often referred to in the art simply as “base”
- “unmodified” or “natural” nucleobases include adenine (A), guanine (G), thymine (T), cytosine (C) and uracil (U).
- Modified nucleobases include nucleobases found only infrequently or transiently in natural nucleic acids, e.g., hypoxanthine, 6-methyladenine, 5-Me pyrimidines, particularly 5-methylcytosine (also referred to as 5-methyl-2' deoxycytosine and often referred to in the art as 5-Me-C), 5-hydroxymethylcytosine (HMC), glycosyl HMC and gentobiosyl HMC, as well as synthetic nucleobases, e.g., 2-aminoadenine, 2- (methylamino)adenine, 2- (imidazolylalkyl)adenine, 2-(aminoalklyamino)adenine or other heterosubstituted alkyladenines, 2-thiouracil, 2-thiothymine, 5-bromouracil, 5- hydroxymethyluracil, 8- azaguanine, 7-deazaguanine, N6 (6-aminohexyl)a
- oligonucleotide it is not necessary for all positions in a given oligonucleotide to be uniformly modified, and in fact more than one of the aforementioned modifications may be incorporated in a single oligonucleotide or even at within a single nucleoside within an oligonucleotide.
- both a sugar and an internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups.
- the base units are maintained for hybridization with an appropriate nucleic acid target compound.
- PNA peptide nucleic acid
- PNA compounds the sugar- backbone of an oligonucleotide is replaced with an amide containing backbone, for example, an aminoethylglycine backbone.
- the nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone.
- Representative United States patents that teach the preparation of PNA compounds comprise, but are not limited to, US patent nos. 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found in Nielsen et al, Science, 1991, 254, 1497-1500.
- Inhibitory nucleic acids can also include one or more nucleobase (often referred to in the art simply as “base”) modifications or substitutions.
- base any nucleobase (often referred to in the art simply as “base”) modifications or substitutions.
- “unmodified” or “natural” nucleobases comprise the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U).
- Modified nucleobases comprise other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudo-uracil), 4-thiouracil, 8-halo, 8-amino, 8- thiol, 8- thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5- bromo, 5-trifluoromethyl and other 5-
- the inhibitory nucleic acids are chemically linked to one or more moieties or conjugates that enhance the activity, cellular distribution, or cellular uptake of the oligonucleotide.
- moieties comprise but are not limited to, lipid moieties such as a cholesterol moiety, cholic acid, a thioether, e.g., hexyl-S- tritylthiol, a thiocholesterol, an aliphatic chain, e.g., dodecandiol or undecyl residues, a phospholipid, e.g., di-hexadecyl-rac- glycerol or triethylammonium 1 ,2-di-O-hexadecyl- rac-glycero-3-H-phosphonate, a polyamine or a polyethylene glycol chain, a palmityl moiety, or an octadecylamine or hexy
- conjugate groups of the invention include intercalators, reporter molecules, polyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of oligomers.
- Typical conjugate groups include cholesterols, lipids, phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluoresceins, rhodamines, coumarins, and dyes.
- Groups that enhance the pharmacodynamic properties include groups that improve uptake, enhance resistance to degradation, and/or strengthen sequence-specific hybridization with the target nucleic acid.
- Groups that enhance the pharmacokinetic properties include groups that improve uptake, distribution, metabolism or excretion of the compounds of the present invention.
- the inhibitory nucleic acids useful in the present compositions and methods are sufficiently complementary to all or part of a target viral RNA, i.e., hybridize sufficiently well and with sufficient specificity, to give the desired therapeutic effect. “Complementary” refers to the capacity for pairing, through hydrogen bonding, between two sequences comprising naturally or non-naturally occurring bases or analogs thereof.
- hybridization means hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases.
- adenine and thymine are complementary nucleobases which pair through the formation of hydrogen bonds.
- Complementary refers to the capacity for precise pairing between two nucleotides.
- the inhibitory nucleic acids and the target viral RNA are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleotides that can hydrogen bond with each other.
- “specifically hybridizable” and “complementary” are terms which are used to indicate a sufficient degree of complementarity or precise pairing such that stable and specific binding occurs between the inhibitory nucleic acid and the target viral RNA. For example, if a base at one position of an inhibitory nucleic acid is capable of hydrogen bonding with a base at the corresponding position of the target viral RNA, then the bases are considered to be complementary to each other at that position.
- a complementary nucleic acid sequence need not be 100% complementary to that of its target nucleic acid to be specifically hybridisable.
- a complementary nucleic acid sequence for purposes of the present compositions and methods is specifically hybridisable when binding of the sequence to the target viral RNA interferes with the abundance, splicing, post-transcriptional processing, or translation of the mRNA transcript, and there is a sufficient degree of complementarity to avoid non-specific binding of the sequence to non-target nucleic acid molecules under conditions in which specific binding is desired, e.g., under physiological conditions.
- the inhibitory nucleic acids useful in the compositions and methods described herein have at least 80% sequence complementarity to a target region within the target nucleic acid, e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence complementarity to the target region.
- a target region within the target nucleic acid
- an antisense compound in which 18 of 20 nucleobases of the antisense oligonucleotide are complementary, and would therefore specifically hybridize, to a target region would represent 90 percent complementarity.
- Percent complementarity of an inhibitory nucleic acid with a region of a target nucleic acid can be determined routinely using basic local alignment search tools (BLAST programs) (Altschul et al., J. Mol. Biol., 1990, 215, 403-410; Zhang and Madden, Genome Res., 1997, 7, 649-656).
- BLAST programs Basic local alignment search tools
- the inhibitory nucleic acids must retain specificity for their target, i.e., must not directly bind to, or directly significantly affect expression levels of, transcripts other than the intended target viral RNA.
- the inhibitory nucleic acid is capable of reducing viral infection in a human cell (e.g., HEK293 cell, an iPSC cell, BEAS-2B cell, or A549-ACE2 cell).
- a human cell e.g., HEK293 cell, an iPSC cell, BEAS-2B cell, or A549-ACE2 cell.
- the viral infection is reduced by at least about 30%, by at least about 35%, by at least about 40%, by at least about 45%, by at least about 50%, by at least about 55%, by at least about 60%, by at least about 65%, by at least about 70%, by at least about 75%, by at least about 80%, by at least about 85%, by at least about 90%, by at least about 95%, or about 100% compared to a human cell that is not exposed to the inhibitory nucleotide.
- the pharmaceutical composition is an inhibitory nucleic acids comprising a sequence selected from SEQ ID NOs: 1-40 (Table 1).
- the inhibitory nucleic acid includes a nucleic acid complementary to a sequence selected from SEQ ID NOs: 1-40.
- the inhibitory nucleic acid includes a double stranded inhibitory nucleic acid comprising at least one sequence selected from SEQ ID NO: 1-40.
- an inhibitory nucleic acid includes a sequence complement to at least one sequence selected from SEQ ID NO: 1-40.
- the pharmaceutical composition is an inhibitory nucleotide comprising any of SEQ ID NOs: 1-40 (and/or its complement) wherein all or some of the nucleotides have been modified in one or more ways as described above, e.g., with a phosphorothioate backbone, one or more 2'-methoxyethoxy modifications, one or more 2'-methoxy modifications, one or more 2'-propoxy modifications, or one or more 2'-fluoro modifications.
- the pharmaceutical composition comprises any of SEQ ID NOs: 1-40, and/or its complement.
- the pharmaceutical composition comprising any of SEQ ID NOs: 1-40, and its complement, is therapeutically effective for treating a disease caused by SARS-CoV2.
- Pharmaceutical Compositions The methods described herein can include the administration of pharmaceutical compositions and formulations comprising inhibitory nucleic acid sequences designed to target SARS-CoV2 RNA or host RNA that interacts with SARS-CoV2 proteins.
- the compositions are formulated with a pharmaceutically acceptable carrier.
- compositions and formulations can be administered parenterally, topically, orally or by local administration, such as by aerosol or transdermally.
- the pharmaceutical compositions can be formulated in any way and can be administered in a variety of unit dosage forms depending upon the condition or disease and the degree of illness, the general medical condition of each patient, the resulting preferred method of administration and the like. Details on techniques for formulation and administration of pharmaceuticals are well described in the scientific and patent literature, see, e.g., Remington: The Science and Practice of Pharmacy, 21st ed., 2005.
- the inhibitory nucleic acids can be administered alone or as a component of a pharmaceutical formulation (composition).
- the compounds may be formulated for administration, in any convenient way for use in human or veterinary medicine.
- compositions may conveniently be presented in unit dosage form and may be prepared by any methods well known in the art of pharmacy.
- the amount of active ingredient (e.g., nucleic acid sequences of this invention) which can be combined with a carrier material to produce a single dosage form can vary depending upon the host being treated, the particular mode of administration.
- the amount of active ingredient which can be combined with a carrier material to produce a single dosage form will generally be that amount of the compound which produces a therapeutic effect, e.g., reduction in SARS-CoV2 infection levels and/or reduction in a symptom of COVID19.
- Pharmaceutical compositions described herein can be prepared according to any method known to the art for the manufacture of pharmaceuticals. Such compositions can contain, for example, preserving agents.
- compositions can be admixtured with nontoxic pharmaceutically acceptable excipients which are suitable for manufacture.
- Compositions may comprise one or more diluents, emulsifiers, preservatives, buffers, excipients, etc. and may be provided in such forms as liquids, powders, emulsions, lyophilized powders, controlled release formulations, on patches, in implants, etc.
- Wetting agents, emulsifiers, and lubricants such as sodium lauryl sulfate and magnesium stearate, as well as coloring agents, release agents, coating agents, sweetening, flavoring and perfuming agents, preservatives and antioxidants can also be present in the compositions.
- Aqueous suspensions can contain an active agent (e.g., nucleic acid sequences of the invention) in admixture with excipients suitable for the manufacture of aqueous suspensions, e.g., for aqueous intradermal injections.
- an active agent e.g., nucleic acid sequences of the invention
- Such excipients include a suspending agent, such as sodium carboxymethylcellulose, methylcellulose, hydroxypropylmethylcellulose, sodium alginate, polyvinylpyrrolidone, gum tragacanth and gum acacia, and dispersing or wetting agents such as a naturally occurring phosphatide (e.g., lecithin), a condensation product of an alkylene oxide with a fatty acid (e.g., polyoxyethylene stearate), a condensation product of ethylene oxide with a long chain aliphatic alcohol (e.g., heptadecaethylene oxycetanol), a condensation product of ethylene oxide with a partial ester derived from a fatty acid and a hexitol (e.g., polyoxyethylene sorbitol mono-oleate), or a condensation product of ethylene oxide with a partial ester derived from fatty acid and a hexitol anhydride (e.g., polyoxyethylene sorbitan mono
- the aqueous suspension can also contain one or more preservatives such as ethyl or n-propyl p-hydroxybenzoate, one or more coloring agents, one or more flavoring agents and one or more sweetening agents, such as sucrose, aspartame or saccharin.
- preservatives such as ethyl or n-propyl p-hydroxybenzoate
- coloring agents such as ethyl or n-propyl p-hydroxybenzoate
- flavoring agents such as sucrose, aspartame or saccharin.
- sweetening agents such as sucrose, aspartame or saccharin.
- Formulations can be adjusted for osmolarity.
- oil-based pharmaceuticals are used for administration of nucleic acid sequences as described herein.
- an injectable oil vehicle see Minto (1997) J. Pharmacol. Exp. Ther.281:93-102.
- Pharmaceutical compositions can also be in the form of oil-in-water emulsions
- Suitable emulsifying agents include naturally-occurring gums, such as gum acacia and gum tragacanth, naturally occurring phosphatides, such as soybean lecithin, esters or partial esters derived from fatty acids and hexitol anhydrides, such as sorbitan mono-oleate, and condensation products of these partial esters with ethylene oxide, such as polyoxyethylene sorbitan mono-oleate.
- the emulsion can also contain sweetening agents and flavoring agents, as in the formulation of syrups and elixirs. Such formulations can also contain a demulcent, a preservative, or a coloring agent.
- these injectable oil-in-water emulsions of the invention comprise a paraffin oil, a sorbitan monooleate, an ethoxylated sorbitan monooleate and/or an ethoxylated sorbitan trioleate.
- the pharmaceutical compositions can also be delivered as microspheres for slow release in the body.
- microspheres can be administered via intradermal injection of drug which slowly release subcutaneously; see Rao (1995) J. Biomater Sci. Polym. Ed. 7:623-645; as biodegradable and injectable gel formulations, see, e.g., Gao (1995) Pharm. Res.
- the pharmaceutical compositions can be parenterally administered, such as by intravenous (IV) administration or administration into a body cavity or lumen of an organ.
- IV intravenous
- These formulations can comprise a solution of active agent dissolved in a pharmaceutically acceptable carrier.
- Acceptable vehicles and solvents that can be employed are water and Ringer's solution, an isotonic sodium chloride.
- sterile fixed oils can be employed as a solvent or suspending medium. For this purpose any bland fixed oil can be employed including synthetic mono- or diglycerides.
- fatty acids such as oleic acid can likewise be used in the preparation of injectables. These solutions are sterile and generally free of undesirable matter.
- These formulations may be sterilized by conventional, well known sterilization techniques.
- the formulations may contain pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate and the like.
- concentration of active agent in these formulations can vary widely, and will be selected primarily based on fluid volumes, viscosities, body weight, and the like, in accordance with the particular mode of administration selected and the patient's needs.
- the formulation can be a sterile injectable preparation, such as a sterile injectable aqueous or oleaginous suspension.
- This suspension can be formulated using those suitable dispersing or wetting agents and suspending agents.
- the sterile injectable preparation can also be a suspension in a nontoxic parenterally- acceptable diluent or solvent, such as a solution of 1,3-butanediol.
- the administration can be by bolus or continuous infusion (e.g., substantially uninterrupted introduction into a blood vessel for a specified period of time).
- the pharmaceutical compounds and formulations can be lyophilized.
- Stable lyophilized formulations comprising an inhibitory nucleic acid can be made by lyophilizing a solution comprising a pharmaceutical of the invention and a bulking agent, e.g., mannitol, trehalose, raffinose, and sucrose or mixtures thereof.
- a process for preparing a stable lyophilized formulation can include lyophilizing a solution about 2.5 mg/mL protein, about 15 mg/mL sucrose, about 19 mg/mL NaCl, and a sodium citrate buffer having a pH greater than 5.5 but less than 6.5. See, e.g., U.S.20040028670.
- the compositions and formulations can be delivered by the use of liposomes.
- liposomes particularly where the liposome surface carries ligands specific for target cells, or are otherwise preferentially directed to a specific organ, one can focus the delivery of the active agent into target cells in vivo. See, e.g., U.S. Patent Nos.6,063,400; 6,007,839; Al-Muhammed (1996) J. Microencapsul.13:293-306; Chonn (1995) Curr. Opin. Biotechnol.6:698-708; Ostro (1989) Am. J. Hosp. Pharm. 46:1576-1587.
- liposome means a vesicle composed of amphiphilic lipids arranged in a bilayer or bilayers. Liposomes are unilamellar or multilamellar vesicles that have a membrane formed from a lipophilic material and an aqueous interior that contains the composition to be delivered. Cationic liposomes are positively charged liposomes that are believed to interact with negatively charged DNA molecules to form a stable complex. Liposomes that are pH-sensitive or negatively-charged are believed to entrap DNA rather than complex with it. Both cationic and noncationic liposomes have been used to deliver DNA to cells.
- Liposomes can also include “sterically stabilized” liposomes, i.e., liposomes comprising one or more specialized lipids. When incorporated into liposomes, these specialized lipids result in liposomes with enhanced circulation lifetimes relative to liposomes lacking such specialized lipids.
- sterically stabilized liposomes are those in which part of the vesicle-forming lipid portion of the liposome comprises one or more glycolipids or is derivatized with one or more hydrophilic polymers, such as a polyethylene glycol (PEG) moiety.
- PEG polyethylene glycol
- compositions disclosed herein can be administered for prophylactic and/or therapeutic treatments.
- compositions are administered to a subject who is infected or at risk of infection with SARS-CoV2, in an amount sufficient to cure, alleviate or partially arrest the clinical manifestations of the disorder or its complications; this can be called a therapeutically effective amount.
- pharmaceutical compositions of the invention are administered in an amount sufficient to decrease the number of lung cells infected with SARS-CoV2. The amount of pharmaceutical composition adequate to accomplish this is a therapeutically effective dose.
- the dosage schedule and amounts effective for this use i.e., the dosing regimen, will depend upon a variety of factors, including the stage of the disease or condition, the severity of the disease or condition, the general state of the patient's health, the patient’s physical status, age, and the like.
- the mode of administration also is taken into consideration.
- the dosage regimen also takes into consideration pharmacokinetics parameters well known in the art, i.e., the active agents’ rate of absorption, bioavailability, metabolism, clearance, and the like (see, e.g., Hidalgo-Aragones (1996) J. Steroid Biochem. Mol. Biol.
- compositions can be given depending on for example: the dosage and frequency as required and tolerated by the patient, the degree and amount of therapeutic effect generated after each administration (e.g., effect on symptoms), and the like.
- the compositions should provide a sufficient quantity of active agent to effectively treat, prevent or ameliorate conditions, diseases or symptoms.
- the inhibitory nucleic acids used to practice the methods described herein can be isolated from a variety of sources, genetically engineered, amplified, and/or expressed/ generated recombinantly. Recombinant nucleic acid sequences can be individually isolated or cloned and tested for a desired activity. Any recombinant expression system can be used, including e.g.
- Inhibitory nucleic acids of the invention can be inserted into delivery vectors and expressed from transcription units within the vectors.
- the recombinant vectors can be DNA plasmids or viral vectors.
- Generation of the vector construct can be accomplished using any suitable genetic engineering techniques well known in the art, including, without limitation, the standard techniques of PCR, oligonucleotide synthesis, restriction endonuclease digestion, ligation, transformation, plasmid purification, and DNA sequencing, for example as described in Sambrook et al. Molecular Cloning: A Laboratory Manual. (1989)), Coffin et al. (Retroviruses.
- RNA Viruses A Practical Approach
- suitable vectors are available for transferring nucleic acids of the invention into cells.
- Viral vectors comprise a nucleotide sequence having sequences for the production of recombinant virus in a packaging cell.
- Viral vectors expressing nucleic acids of the invention can be constructed based on viral backbones including, but not limited to, a retrovirus, lentivirus, adenovirus, adeno-associated virus, pox virus, or alphavirus.
- the recombinant vectors capable of expressing the nucleic acids of the invention can be delivered as described herein, and persist in target cells (e.g., stable transformants).
- target cells e.g., stable transformants.
- the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2, and/or any variant of SARS-CoV2) is a positive-sense single-stranded (+ss) RNA virus, wherein the viral genome encodes 29 proteins, which include the four structural proteins, membrane or matrix (M), nucleocapsid (N), envelope (E), and spike (S) proteins.
- SARS-CoV-2 positive-sense single-stranded
- M membrane or matrix
- N nucleocapsid
- E envelope
- S spike
- kits for treating Coronavirus disease 2019 (COVID-19) in a subject that include administering to the subject a therapeutically effective amount of a composition comprising an exogenous nucleic acid (e.g., an inhibitory nucleic acid) and delivering the exogenous nucleic acid into a cell, wherein the exogenous nucleic acid binds to a target RNA and modulates gene expression of the target RNA, thereby treating Coronavirus disease 2019 (COVID-19) in the subject.
- an exogenous nucleic acid e.g., an inhibitory nucleic acid
- exogenous nucleic acid refers to any DNA or RNA, or a combination thereof, in either a single- or double-stranded form that is introduced from or originating from outside a cell, a tissue or an organism that is not produced by or does not originate from the same cell, tissue, or organism in which it is being introduced.
- the exogenous nucleic acid is a single stranded nucleic acid.
- the exogenous nucleic acid is a double stranded nucleic acid.
- the exogenous nucleic acid is an inhibitory nucleic acid.
- the exogenous nucleic acid is one of the inhibitory nucleic acids described herein.
- the inhibitory nucleotides disclosed herein modulate gene expression within a cell.
- modulating can refer to modifying, regulating, or altering the endogenous gene expression in a cell.
- modulating gene expression can include systematically influencing RNA stability and/or translation by activating or suppressing the gene expression.
- modulation of gene expression can include stabilizing a target RNA.
- stabilizing a target RNA can increase translation of the target RNA.
- modulation of gene expression can include destabilizing a target RNA.
- destabilizing a target RNA can suppress translation of the target RNA.
- modulation of gene expression can include increasing translation of a target RNA.
- modulation of gene expression can include suppressing translation of a target RNA.
- the gene expression of the target RNA is upregulated.
- the gene expression of the target RNA is downregulated.
- the modulating gene expression comprises knockdown of a gene expressed from the target RNA.
- the target RNA interacts with a viral protein.
- the target RNA interacting with the viral protein increases protein expression from the gene expressed from the target RNA.
- the target RNA interacting with the viral protein inhibits protein expression from the gene expressed from the target RNA.
- the modulating comprises blocking the target RNA from interacting with the viral protein.
- the modulating comprises blocking the target RNA from interacting with the viral protein wherein the exogenous nucleic acid binds to the target RNA.
- the modulating comprises blocking the target RNA from interacting with the viral protein wherein the exogenous nucleic acid binds to the target RNA by targeting the structured genomic RNA that interacts with the viral protein.
- an endogenous gene is expressed from the target RNA.
- the gene expressed from the target RNA is selected from the group consisting of RPN1, UGGT1, OST4, NDUFA4, HSPA5, PSMD13, LAPTMFA, LAMP1, C1QBP, and LDHB.
- the viral protein is a SARS-CoV-2 protein (e.g., SARS-CoV2 and any variant thereof).
- the SARS-CoV-2 protein binds to the 5’ UTR, the coding sequence, or the 3’ UTR region of the target RNA.
- the SARS-CoV-2 protein binds to the 5’ UTR region of the target RNA.
- the SARS-CoV-2 protein binds to the coding sequence region of the target RNA. In some embodiments, the SARS-CoV-2 protein binds to the 3’ UTR region of the target RNA. In some embodiments, the delivering of an exogenous nucleic acid into a cell comprises transfection of the exogenous nucleic acid into the cell. In some embodiments, the delivering comprises a virus-based delivery of the exogenous nucleic acid into the cell. In some embodiments, the virus-based delivery comprises a lentivirus. Coronavirus-derived protein As used herein, the term “viral protein” refers to a protein generated by a virus. A viral protein can be both a component and a product of a virus.
- Viral proteins can be grouped according to their functions, wherein groups of the viral proteins can include structural proteins, nonstructural proteins, regulatory proteins, and accessory proteins.
- Coronaviruses contain a positive-sense, single-stranded RNA genome, and the viral genome consists of more than 29,000 bases and encodes 29 proteins.
- SARS-CoV-2 has four structural proteins: the E and M proteins, which form the viral envelope; the N protein, which binds to the virus’s RNA genome; and the S protein, which binds to human receptors.
- a “coronavirus-derived protein” or “SARS-CoV-2 protein” can refer to a protein that is encoded from the coronavirus viral genome.
- the SARS-CoV-2 protein can be a structural protein. In some embodiments, the SARS-CoV-2 protein can be a non- structural protein (NSP). In some embodiments, the SARS-CoV-2 protein can comprise a NSP1, a NSP2, a NSP3, a NSP4, a NSP5, a NSP6, a NSP7, a NSP8, a NSP9, a NSP10, a NSP11, a NSP12, a NSP13, a NSP14, a NSP15, or a NSP16 protein. In some embodiments, the SARS- CoV-2 protein can be an accessory protein.
- the SARS-CoV-2 protein can comprise a ORF3a, a ORF3b, a ORF6, a ORF7a, a ORF7b, a ORF8, a ORF9b, a ORF9c, or a ORF10 protein.
- the SARS-CoV-2 protein comprises a NSP1, a NSP2, a NSP3, a NSP4, a NSP5, a NSP6, a NSP7, a NSP8, a NSP9, a NSP10, a NSP11, a NSP12, a NSP13, a NSP14, a NSP15, a NSP16, a ORF3a, a ORF3b, a ORF6, a ORF7a, a ORF7b, a ORF8, a ORF9b, a ORF9c, or a ORF10 protein.
- the SARS- CoV-2 protein is selected from the group consisting of NSP2, NSP3, NSP6, NSP12, NSP14, ORF3b, ORF7b, and ORF9c.
- NSP2, NSP3, NSP6, NSP12, NSP14, ORF3b, ORF7b, and ORF9c interact with host RNA.
- inhibitory nucleic acids block the interaction between a coronavirus-derived protein and a host RNA.
- the inhibitory nucleic acid modulates gene expression of a host RNA.
- Example 1 - eCLIP elucidates SARS-CoV-2 protein-viral RNA interactions in virus infected cells
- SARSCoV-2 infected African Green Monkey kidney (Vero E6) cells was performed, which are an efficiently infected cell line (FIG. 1A).
- Infected cells were subject to UV irradiation, which covalently crosslinked interacting proteins to RNAs.
- UV irradiation which covalently crosslinked interacting proteins to RNAs.
- NSP8 and NSP12 which form part of the replication transcription complex (RTC), and N (nucleocapsid
- RNA-bound proteins were resolved via SDS-PAGE and transferred to nitrocellulose membranes such that only the region spanning the expected protein size and 75 kDa larger were excised and purified in subsequent steps.
- the same size region of a non- immunoprecipitated input whole cell lysate was included as size-matched input to identify and remove non-specific, enriched sequences.
- RNA was converted to DNA libraries, sequenced to an average depth of ⁇ 25 million reads, and mapped to the SARS-CoV-2 viral genome and African Green Monkey genome to determine SARS-CoV-2 protein RNA interactions.
- reads from the immunoprecipitation (IP) samples correspond to RNA crosslinked to the IP enriched proteins NSP8, NSP12, and N
- reads from the input (IN) samples correspond to RNA crosslinked to RBPs at a similar size to the IP protein in the cell milieu.
- Normalizing read density of IP to IN samples provide a measure of protein-specific RNA interaction.
- the eCLIP results provide a genome-wide map of RNA interactions with viral proteins during an authentic SARS-CoV-2 infection. Reads were mapped across the entire genome in the positive sense for both IP and IN samples for all proteins, with greater than 96% coverage in all input and IP samples. The near complete coverage implies that most of the viral RNA interacts with RNA binding proteins.
- NSP12 and NSP8 appear strongly enriched (>5-fold relative to global median) after the stop codon in N at position 29,533 up to the start of the S2M structured region in the 3’ UTR at position 29,695.
- NSP12 and NSP8 continued to be strongly enriched again after the S2M region at position 29,809.
- the lack of enrichment at the S2M structure may suggest that its function is unrelated to recruiting replicase proteins NSP12 and NSP8.
- the eCLIP findings thus provide a map of the direct interaction between replicase proteins NSP8 and NSP12 with regions in the UTRs likely involved in regulating replication and transcription. [Table 2] SARS-CoV-2 genome regions with ⁇ 5-fold ⁇ ReadDensity
- NSP12 and NSP8 IP samples enriched the fraction of negative sense reads to 0.067 and 0.039, about 100- and 10-fold from IN samples, respectively.
- the negative sense strand coverage in IP samples was also lower in N (33%) than in NSP12 (80%) or NSP8 (58%).
- NSP12 and NSP8 IP reads are piled up in the 5’ and 3’ regions on the negative sense strand, which is similar to the positive sense strand.
- this peak overlapped with a structured region of 3 consecutive stem loops at region 7412-7545 (FIG.1E).
- a filter binding assay of NSP12 was performed with in vitro transcribed RNA bearing the sequence in this region (FIG. 1F).
- a scrambled control was included with the same sequence composition but shuffled such that the structure is no longer preserved (Table 3).
- the peak RNA showed a binding dissociation constant KD of 17 nM, compared to a K D of 79 nM by the scrambled control (FIG.1G).
- the scrambled control likely represents the non-specific affinity of NSP12 for RNA (FIG.7B).
- FIG.7B Nucleic acid sequences used in filter binding assay
- N and NSP12 show no correlation (R 2 of 0.01), whereas N and NSP8 are slightly correlated (R2 of 0.27; FIG.1H).
- R2 of 0.27 As expected, NSP8 and NSP12 were the most highly correlated (R2 of 0.57), though there were still substantial differences, as 43% of the variation was unaccounted for by the other protein (FIG. 1H).
- R2 of 0.57 As expected, when the data was transformed logarithmically, greater correlations among all three proteins were observed. As logarithmic transformations shed light on signals at lower values, the greater correlations among the log-transformed data of different proteins may imply a greater similarity of more transient protein-RNA interactions.
- LNA locked nucleic acid
- LNA7472– 7493 (SEQ ID NO: 52 - +T+A+C+A+A+GT+T+G+AT+GA+A+T+TA+C+A+A+C+C) and scrambled LNA (SEQ ID NO: 53 - +G+C+GGC+ACG+TTG+CG+AGT+A+C+T) were synthesized by Integrated DNA Technologies.
- the LNA was predicted to hybridize with SARS-CoV-2 RNA from position 7414-7555 using the NUPACK software1 (FIG. 1J).
- Vero E6 cells transfected with LNA7472–7493 and a scrambled LNA were infected with SARS- CoV-2 (WA1 isolate) and the infection rate was assessed by immunofluorescence 24 hours post infection. It was found that treating cells with LNA7472–7493 reduced the infection rate (as given by integrated intensity) by 64% (p ⁇ 0.01) (FIGs. 1K-1L). This shows that targeting structured genomic RNA interacting with SARS-CoV-2 protein NSP12 may be a potential antiviral treatment strategy.
- modalities may not be limited to LNAs, but also to include siRNAs and small molecules.
- RNA virus infections have been shown to perturb host transcriptomes, such as via mRNA degradation, mRNA export inhibition, splicing interference, 5’ cap stealing, and other ways of host translation inhibition
- evidence for direct interaction between viral proteins host cell RNAs can shed light on the mechanism and function as a result of these interactions. Therefore, the extent to which NSP8, NSP12 and N interacted with host RNAs was investigated.
- Targeted transcripts were determined by having one or more peaks that meet the stringent IDR (irreproducible discovery rate) threshold of overlapping peaks between two replicates for every protein, and satisfy statistical cutoffs of p ⁇ 0.001, and more than 8-fold enrichment in the immunoprecipitated sample (IP) over the size-matched input sample.
- IDR immunoprecipitated discovery rate
- IP immunoprecipitated sample
- NSP12 and NSP8 shared 128 genes in common (18% of NSP12 targets, 26% of NSP8 targets), implying that NSP12 and NSP8 may interact with different host genes in both their individual and complexed states (FIG. 2C).
- FOG. 2C a majority of N immunoprecipitated RNA reads were mapped to viral RNA, consistent with its role in enclosing the viral genome during virion assembly.
- the large number of peaks (2137 total) that map to the 1058 host genes further suggests a potential in perturbing host gene expression that may be required for viral replication.
- transcriptome-wide mRNA sequencing of SARS-CoV-2 infected Vero E6 cells was performed and the eCLIP target genes were mapped to differential expression levels. It was found that NSP12 and NSP8 target mRNA levels are significantly increased than non-target genes (p ⁇ 10 -5 , KS test; FIG. 2D).
- a Gene Ontology analysis was perfomed. 54 GO processes that are significantly enriched by NSP12 target genes were found (P adjusted ⁇ 0.01) (Table 4), whereas no significant GO processes are found in NSP8 target genes.
- GO processes fall into three broad categories related to regulating transcription and gene expression, cell cycle and apoptosis, and phosphorylation and signaling processes (FIG.2E).
- transcription regulation genes many have antiviral response properties (e.g. NF- ⁇ B, BATF, NR4A1, BMP2, SQSTM1, MAFF, MDM2), while others have demonstrated proviral activities, such as DDX5, SFPQ, FBXW11 and ATF-3.
- PAK2 has been associated with anti-apoptotic signaling and promoting HIV survival, whereas many other genes have overlapping annotation as the transcription regulation genes.
- Example 3 Exogeneously expressed SARS-CoV-2 proteins interact with one third of the transcriptome in lung epithelial cells Even though NSP12, NSP8, and N eCLIPs were performed in virus infected vero cells, in order to further investigate whether SARS-COV-2 proteins directly interact with the human host transcriptome, eCLIP was performed on the 29 proteins encoded in the SARS-CoV-2 genome and one mutant (FIG.3A). Due to the lack of antibodies specific for most of the viral proteins, the individual proteins were exogenously expressed in a lung epithelial cell line BEAS-2B, which is an immortalized primary bronchial cell line representative of normal lung physiology.
- BEAS-2B is an immortalized primary bronchial cell line representative of normal lung physiology.
- Each protein was either fused with a 2xStrep tag and expressed stably via lentiviral transduction or fused with a 3xFLAG tag and expressed transiently via transfection. Following UV crosslinking, the tagged proteins were immunoprecipitated using anti-FLAG or anti-Strep antibodies. Subsequent RNA purification and library purification steps were performed as in the viral eCLIP experiments. Cells expressing only the 3xFLAG or 2xStrep tags and wildtype cells were used as controls to remove background peaks in subsequent analysis steps. From the SARS-CoV-2 proteome-wide eCLIP results, SARS-CoV-2 proteins interacted with RNA represented by 4773 coding genes, which is about a third of the transcriptome of BEAS-2B cells.
- Nucleocapsid and non-structural proteins NSP2, NSP3, NSP5, NSP9 and NSP12 were found to target the greatest number of unique genes at 1339, 1647, 1199, 902, 863, and 865, respectively (FIG. 3B).
- the large number of genes targeted by the viral proteins is consistent with the non-structural proteins from the replicase (ORF1ab) having a high affinity for its own RNA, though their potential for widespread interaction with host RNA has not been shown previously.
- the widespread interaction of Nucleocapsid with host RNAs when expressed in isolation is consistent with its capacity for nonspecific RNA binding, whereas its targeting the virus genome during RNA assembly occurs via interaction with the M protein.
- the extensively studied splicing factor RBFOX2 binds to 958 genes in HepG2 cells and 471 genes in K562 cells
- the stress granule assembly factor G3BP1 binds to 561 genes in HepG2 cells
- the histone RNA hairpin-binding protein SLBP binds to 19 genes in K562 (FIG.3B).
- viral proteins have similar capacities for interacting with RNA as endogenous human RBPs.
- Target genes with at least one significant eCLIP peak also appear highly distinct across the different SARS-CoV-2 proteins (FIG. 3C).
- eCLIP reads also display different profiles, example include N eCLIP peak in 3’ UTR of CXCL1, NSP3 peaks found across all exons in DYNCH1, a NSP12 peak in 5’ UTR of TUSC3 and a NSP2 peak in the intronic region upstream of 3’ splice site of NAP1L4 (FIG. 3D).
- N eCLIP peak in 3’ UTR of CXCL1 NSP3 peaks found across all exons in DYNCH1
- NSP12 peak in 5’ UTR of TUSC3 and a NSP2 peak in the intronic region upstream of 3’ splice site of NAP1L4 (FIG. 3D).
- SARS-CoV-2 proteins interact with host cell RNAs
- a subset of these proteins were validated as RBPs using crosslinking and solid phase purification (CLASP26), which stringently captures crosslinked protein-RNA interactions due to denaturing wash conditions.
- HEK293T cells transiently expressing NSP1, NSP2, NSP12, and ORF9c followed by pulldown of total RNA showed enrichment of these proteins (FIG. 3E).
- host RNA binding proteins ELAVL1, YTHDC1 and GAPDH were also included as positive controls, and tubulin as negative controls.
- RNA interactome capture (RIC27) of poly-A RNA (mRNA, lincRNA, and other POLII transcripts) pulldown was performed using an oligo (dT) primer and found that NSP2 and NSP12 were enriched, but not NSP1 (FIG.3F), which mostly enriched ribosomal RNAs in eCLIP.
- Distinct processes related to viral replication and host response are targeted by the viral proteins as shown by gene ontology (GO) analysis (FIG.3G).
- Many of the enriched GO terms are related to nucleic acid and protein synthesis, modification and transport, which is consistent with the primary objective of the virus hijacking host resources for its own biosynthesis and replication.
- several protein transport processes are enriched, namely SRP-dependent protein targeting to membrane as enriched by NSP6, ORF3a and N, and COPII vesicle budding and targeting from rough ER to Golgi as enriched by NSP12. These may be involved in viral vesicle formation to serve as replication organelles, as found in a number of positive sense RNA viruses.
- Immune response processes are also enriched, including neutrophil mediated immunity targeted by NSP12 and platelet degranulation targeted by ORF9c. This supports the choice of lung epithelial cells as a model system that express the relevant cytokines for recruiting immune cells. While the enriched GO terms are highly relevant to viral and host response processes, further analysis of binding patterns is needed to determine if there are any functional implications of viral proteins interacting with these genes. To determine if there are sequence features that the viral proteins recognize, sequence logos from 6-mers of eCLIP peaks were generated. While some of the proteins display strong sequence preferences, most proteins appear to interact more non-specifically.
- NSP5 has a motif (GNAUG).
- Other motifs may result from regional binding preferences (FIG.3H), as NSP2 and NSP9 have a strong preference for UC- rich polypyrimidine motifs (p values of 10 -96 and 10 -41 respectively), which may be a result of their binding to polypyrimidine tracts in intronic regions, whereas N has an AU-rich motif likely because it preferentially binds to 3’ UTR which contain AU-rich elements.
- NSP3 a large multifunctional protein, appears to coat entire exons and may not have a meaningful sequence motif.
- NSP12 primarily binds in the 5’ UTR, and a weakly enriched GUCCCG motif that resembles terminal oligopyrimidine (TOP) motifs hints at a possible role in translation perturbation.
- TOP terminal oligopyrimidine
- Example 4 SARS-CoV-2 proteins upregulate protein expression of target transcripts
- SARS-CoV-2 proteins are enriched at distinct regions of target mRNAs, which imply different regulatory functions because of the protein-RNA interaction. Aggregating the analysis of all targeted peaks for each SARS-CoV-2 protein identifies RNA regions that are preferentially bound (FIG.4A).
- NSP12, ORF3b, ORF7b and ORF9c show the highest proportion of peaks in the 5’ UTR; NSP2, NSP3, NSP6 and NSP14 show the highest proportion of peaks in the coding region (CDS), NSP5, NSP7 and NSP9 display a high proportion of peaks in intronic regions, and N and NSP15 show the largest proportion of peaks in the 3’ UTR.
- CDS coding region
- NSP5 NSP7 and NSP9 display a high proportion of peaks in intronic regions
- N and NSP15 show the largest proportion of peaks in the 3’ UTR.
- a metagene analysis of read density was performed across all target mRNA transcripts, where each of the 5’ UTR, CDS and 3’ UTR regions in an mRNA are scaled to standardized lengths (FIG.4B).
- NSP2 has a similar number and proportion of peaks in the CDS as NSP3, it mainly targets the region spanning the 5’ UTR and coding start.
- NSP3 reads, along with that of NSP6 and NSP14, coat the entire CDS, with a slight bias towards the start of the coding sequence.
- NSP3, NSP6, NSP12, NSP14, ORF3b, ORF7b and ORF9c - have binding preferences at the 5’ UTR and CDS, it was hypothesized that their protein-RNA interactions could affect expression of the target mRNAs at the level of RNA turnover or translation.
- Renilla luciferase activity normalized to firefly luciferase activity measures up- or downregulation of protein expression via either translation or mRNA stability because of positioning the MCP tagged protein in the vicinity of the Renilla mRNA.
- the luciferase readout does not by itself distinguish between translational or mRNA stabilizing effects. From the tethering experiments, it was found that the ratio of Renilla-MS2 to firefly luciferase for 9 of the 14 SARS-CoV-2 proteins increase 1.9 (NSP6) to 3.5-fold (ORF9c) relative to FLAG-MCP control (p-value ⁇ 0.002, two tailed multiple t-test) (FIG. 4D).
- these SARS-CoV-2 proteins raise the target luciferase activity to greater extent than the tethering of BOLL (1.5-fold), which is a human RBP previously characterized to be amongst the strongest up-regulators from a screen of more than 700 human RBPs.
- BOLL a human RBP previously characterized to be amongst the strongest up-regulators from a screen of more than 700 human RBPs.
- NSP5 NSP16 and N display slight (but not significant) down-regulation effects (0.73-fold to 0.58-fold) compared to the FLAG peptide control, but to a lesser extent than that of the known translation repressor CNOT7 (0.16-fold).
- NSP7 and NSP9 appear to have no effect on the relative luciferase activity of the target Renilla reporter.
- RT-qPCR was performed to measure the ratio of Renilla-MS2 to Firefly mRNAs.
- Ribophorin I is part of an N-oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X- Ser/Thr consensus motif of nascent polypeptide chains
- UDP-Glucose Glycoprotein Glucosyltransferase 1 is a soluble protein of the endoplasmic reticulum (ER) that selectively glucosylates unfolded glycoproteins.
- NDUFA4 is part of the enzyme cytochrome-c oxidase (or complex IV) and is important for its activity and biogenesis. Consistent with the data showing that exogenous expression of ORF9c can interact with RPNI, UGGT1 and NDUFA4 RNA and increase protein expression, it was found that SARS-CoV-2 infection increases RPNI, UGGT1 and NDUFA4 protein levels specifically in infected cells (FIGs.4H-4I).
- siRNA knockdown of these genes was investigated on viral infection or replication in human lung organoid cells (FIGs.6A-6E).
- Human lung organoids are a physiologically relevant system to study infections and have been shown to be highly infectible by SARS-CoV-2.
- siRNAs were selected from the target mRNAs of SARSCoV-2 proteins with mRNA stabilization or translation enhancing activities, in addition to an anti-ACE2 siRNA and a scrambled sequence as a negative control.
- Assays were performed for infected cells by immunofluorescence and infection rate was determined by measuring the total integrated fluorescence intensity of the stained nucleocapsid protein (FIG. 4K). To control for cell viability, the integrated intensity was divided to the area stained by DAPI, and the values were normalized to the scrambled control (FIG. 4L). It was found that siRNA knockdown of RPN1, UGGT1, NDUFA4, HSPA5, PSMD13, LAPTM4A, LAMP1, and LDHB significantly (p ⁇ 0.05, two-tailed t-test) reduced infection rates for at least one of the tested SARS-CoV-2 variants compared to a scrambled siRNA control (FIG.4L).
- siRNA knockdown of NDUFA4, UGGT1 and LAPTM4A significantly reduced viral infection in all three variants.
- SARS-CoV-2 proteins with a preference for binding to 5’ UTR and CDS regions have a capacity for increasing the abundance of target mRNAs and/or translation rates.
- eCLIP target genes are associated with enhanced RNA levels via NSP12 overexpression, and increased translation rates with ORF9c overexpression.
- Example 5 - NSP9 associates with the nuclear pore to block mRNA export Since it was reported that several SARS-CoV-2 proteins are localized to the cell nucleus, it was found that the eCLIP peaks of NSP2, NSP5, NSP7, and NSP9 are enriched in intronic regions (FIG.5A). To test whether these targets are implicated in infection induced alternative splicing, deep sequencing (>50 million 100 nt reads per sample) of SARS-CoV-2 infected A549-ACE2 cells was performed. A total of 1839 alternatively spliced genes were found across all five types of alternative splicing events i.e.
- each eCLIP read can be used to approximate the crosslink site where reverse transcription is aborted or truncated when converting protein-bound RNA to cDNA.
- a strong enrichment for the truncation site was observed at a median of 11 nt upstream of the 3’ splice site (FIG. 5C).
- NSP9 interacts with several nuclear pore complex proteins, including NUP62, NUP214, NUP88, NUP54 and NUP581 (FIG. 5D). It was confirmed that NUP62 indeed co-immunoprecipitated with NSP9.
- NSP9 may interfere with mRNA export by associating with the nuclear pore and interfering with the U2AF/NXF1 complex for RNA substrate recognition (FIG. 5E).
- the significance of NSP2, NSP5, and NSP7 association with this intronic region may be less clear, and will benefit from future studies for clarification.
- assays were performed for the mRNA levels of NSP9 target genes in cytosolic and nuclear fractions.
- NSP9 expressing BEAS-2B cells and the wild type BEAS-2B cells were fractionated, followed by RNA extraction and preparation for mRNA sequencing. No difference was observed in log2 fold changes of mRNA levels in NSP9 overexpressing cells versus wildtype cells between NSP9 eCLIP targets and non-targets, which agrees with the observation of lack of perturbation of target gene expression in the dual reporter assay. However, NSP9 eCLIP targets displayed greater log2 fold change of mRNA levels in the nuclear fraction and lower levels in the cytosolic fraction than non-target genes (FIG. 5E).
- Interleukin 1 ⁇ (IL-1 ⁇ ) and interleukin 1 ⁇ (IL-1 ⁇ ) are important inflammatory cytokines constitutively produced in epithelial cells and plays a central role in regulating immune responses, including being a master cytokine in acute lung inflammation.
- NSP9 inhibiting the nucleocytoplasmic export of the mRNA of IL-1 ⁇ has any impact on the production of this cytokine, an ELISA was performed on the growth media of BEAS-2B wild type and NSP9 expressing cells 48 hours after induction by several common cytokines.
- Interferon ⁇ , ⁇ and ⁇ resulted in lowered IL-1 ⁇ levels in NSP9 cells compared to wild type, though tumor necrosis factor alpha (TNF ⁇ ) resulted in the greatest reduction ( ⁇ 30%) (FIG. 5H).
- TNF ⁇ tumor necrosis factor alpha
- FIG. 5I The observation of reduced IL-1 ⁇ produced at different concentrations of TNF ⁇ (FIG.5I) was reproduced.
- reduced IL-1 ⁇ produced in NSP9 expressing cells than in wildtype BEAS-2B cells was observed (FIG. 5J).
- NSP9 association with the nuclear pore complex proteins aligns with the observation of decreased cytoplasmic abundance of NSP9 target mRNAs, suggesting that NSP9 interaction may directly inhibit nuclear export.
- NSP9 reduced the production of its target gene IL-1 ⁇ / ⁇ , which suggests that the export inhibition mechanism may be a strategy that SARSCoV-2 employs to dampen inflammatory host response.
- the export inhibition mechanism may be a strategy that SARSCoV-2 employs to dampen inflammatory host response.
- similarities in intronic protein-RNA interactions were observed by nonstructural proteins 2, 5, 7, and 9, which resembles the binding profile of splicing factor U2AF2.
- NSP9 reduces cellular mRNA export, likely by interfering with U2AF/NXF1 substrate recognition.
Abstract
Provided are compositions and methods of treating Coronavirus disease 2019 (COVID- 19) in a subject, the method including administering to the subject a therapeutically effective amount of a composition comprising an exogenous nucleic acid and delivering the exogenous nucleic acid into a cell, wherein the exogenous nucleic acid comprises an antisense oligonucleotide, a small interfering RNA (siRNA), or locked nucleic acid, and wherein the exogenous nucleic acid binds to a target RNA and modulates gene expression of the target RNA, thereby treating Coronavirus disease 2019 (COVID- 19) in the subject.
Description
COMPOSITIONS AND METHODS OF MODULATING RNA AND PROTEIN INTERACTIONS CROSS-REFERENCE TO RELATED APPLICATIONS This application claims priority to U.S. Provisional Patent Application No.63/196,005, filed on June 2, 2021. The disclosure of the prior application is considered part of the disclosure of this application, and is incorporated herein by reference in its entirety. SEQUENCE LISTING This application contains a Sequence Listing that has been submitted electronically as an ASCII text file named 156700353WO1_ST25. The ASCII text file, created on June 1, 2022, is 13.2 kilobytes in size. The material in the ASCII text file is hereby incorporated by reference in its entirety. BACKGROUND COVID-19 is caused by the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a positive-sense single-stranded (+ss) RNA virus. The viral genome encodes 29 proteins, which include the four structural proteins, membrane or matrix (M), nucleocapsid (N), envelope (E), and spike (S) proteins. In addition, there are 16 non-structural proteins, NSP1-16, and 9 accessory proteins ORF3a-ORF10, though the expression of some of the accessory factors are still debated. Identification of conserved viral RNA processes, viral protein-host RNA interactions, and understanding how the virus hijacks these processes will enable the discovery of new antiviral targets and strategies. Recent transcriptome-wide and proteome-wide studies in viral protein-host protein interactions, viral protein and RNA interactions with host proteins, and viral RNA-host RNA interactions contribute to the understanding of host-virus interactions in SARS-CoV-2 infection. A recent study on the protein interactome with viral RNA shows that many of the SARS-CoV-2 proteins are RNA binding proteins that bind to its own RNA genome and mRNA transcripts. As a (+ss) RNA virus, SARS-CoV-2 proteins are also found to associate with several host RNA binding proteins (RBPs), suggesting a possibility that SARS-CoV-2 proteins interact with the host transcriptome to a greater degree than previously anticipated. For example, it has been shown that NSP1 binds to the mRNA entry site on the host ribosomal RNA to inhibit host translation. SARS-CoV-2 nucleocapsid protein interactome comprises many host RNA processing machinery proteins and stress granule proteins, suggesting a
potential role in interfering with host RNA processing and driving stress granule formation. One study showed that SARS-CoV-2 proteins bind to ~140 host transcripts. Focusing on non- coding RNAs, SARSCoV-2 NSP16 was found to bind to U1/U2 snRNA to interfere with splicing, while NSP8 and NSP9 bind to the signal recognition ribonucleoprotein 7SK to block protein trafficking. However, knowledge of how viral and host coding RNAs interact with viral proteins and the functional implications of these interactions remain limited. Thus, a comprehensive interrogation of SARS-CoV-2 viral protein-RNA interactions is still needed to gain insights into viral RNA processing and how the virus hijacks host cellular machinery for its replication while simultaneously suppressing host gene expression. SUMMARY Provided herein are pharmaceutical compositions comprising an inhibitory nucleic acid that is complementary to a portion of a target RNA. In some embodiments, the target RNA comprises a portion of the SARS-CoV-2 genome. In some embodiments, the target RNA comprises a portion of human RNA. In some embodiments, the target RNA comprises human mRNA. In some embodiments, the target RNA is associated with a portion of the 5’ untranslated region (UTR) of the target RNA. In some embodiments, the target RNA is associated with a portion of the coding region of the target RNA. In some embodiments, the target RNA is associated with a portion of the 3’ untranslated region of the target RNA. In some embodiments, a SARS-CoV-2 protein selected from the group consisting of NSP2, NSP3, NSP5, NSP6, NSP7, NSP9, NSP12, NSP14, NSP15, ORF3b, ORF7b and ORF9c is capable of binding to a portion of the target RNA. In some embodiments, the target RNA is associated with a gene selected from the group consisting of RPN1, UGGT1, OST4, NDUFA4, HSPA5, PSMD13, LAPTM4A, LAMP1, C1QBP, and LDHB. In some embodiments, the inhibitory nucleic acid comprises one or more small interfering RNAs (siRNA). In some embodiments, the one or more siRNAs comprise at least one sequence selected from the group consisting of SEQ ID NOs: 1-40. In some embodiments, the inhibitory nucleic acid comprises one or more antisense oligonucleotides. In some embodiments, the inhibitory nucleic acid comprises one or more locked nucleic acids (LNA). In some embodiments, the one or more LNAs comprises a sequence selected from the group consisting of SEQ ID NOs: 52 and 53. In some embodiments, the inhibitory nucleic acid comprises one or more phosphorothioate substitutions. In some embodiments, the inhibitory nucleic acid comprises one or more 2’-O-methyl modifications. In some embodiments, the inhibitory nucleic acid comprises one or more 2’-O-methoxyethyl
modifications. In some embodiments, the inhibitory nucleic acid comprises one or more 2 - fluoro modifications. In some embodiments, the one or more siRNAs, antisense oligonucleotides, or locked nucleic acids, are a plurality, wherein each siRNA, antisense oligonucleotide, or locked nucleic acid comprises the same sequence as the other siRNAs, antisense oligonucleotides, or locked nucleic acids in the plurality. In some embodiments, the one or more siRNAs, antisense oligonucleotides, or locked nucleic acids contain a mixture of siRNAs, antisense oligonucleotides, or locked nucleic acids comprising at least two different sequences. In some embodiments, the LNA is enclosed within a vector. In some embodiments, the vector is a lentivirus. In some embodiments, the vector is an adenovirus. In some embodiments, the vector is an adeno-associated viral vector (AAV). In some embodiments, the vector is a liposome. Also provided herein are methods of treating Coronavirus disease 2019 (COVID-19) in a subject, the method comprising: administering to the subject a therapeutically effective amount of any one of the pharmaceutical compositions described herein, thereby treating Coronavirus disease 2019 (COVID-19) in the subject. In some embodiments, the therapeutically effective amount of the pharmaceutical composition decreases target RNA protein expression. Also provided herein are methods of reducing risk of SARS-CoV2 infection in a subject, the method comprising: administering to the subject a therapeutically effective amount of any one of the pharmaceutical compositions described herein, thereby reducing risk of SARS-CoV2 infection in the subject. Also provided herein are inhibitory nucleic acids which is or comprises SEQ ID NOs: 1-40, wherein the inhibitory nucleic acid comprises a siRNA. Also provided herein are inhibitory nucleic acids which is or comprises SEQ ID NOs: 52 or 53, wherein the inhibitory nucleic acid comprises a LNA. BRIEF DESCRIPTION OF DRAWINGS FIG. 1A is an exemplary schematic showing eCLIP performed on SARS-CoV-2 proteins in virus infected Vero E6 cells. Proteins in infected cells were UV crosslinked to bound transcripts, which were immunoprecipitated (IP) with antibodies that recognize NSP8 (primase), NSP12
(RNA dependent RNA polymerase, RdRp) and N (nucleocapsid) proteins. Protein-RNA IP product and Input lysate were resolved by SDS-PAGE and membrane transferred, followed by band excision at the estimated protein size to 75kDa above in both IP and Input lanes. Excised bands were subsequently purified, and library barcoded for Illumina sequencing. FIG.1B shows mean fold change of eCLIP read density mapped to the positive sense SARS- CoV-2 genome in immunoprecipitated (IP) compared to input samples. Mean was taken from n = 2 independent biological samples. FIG. 1C shows NSP12 eCLIP zoomed into yellow highlighted regions in FIG.1B. Top row, NSP12 eCLIP; bottom row, SHAPE Shannon entropy with a sliding median of 55 nt. Shaded region in bottom row is partitioned at the global median entropy. FIG. 1D shows correlation between normalized SHAPE entropy and normalized log2 (Fold Change) of IP over INPUT eCLIP read density i.e. eCLIP enrichment for NSP12 (left), NSP8 (middle) and N (right). R, Pearson’s coefficient. FIG.1E shows secondary structure of the NSP12 eCLIP peak region from position 7412-7545. FIG. 1F shows results from filter-binding assay showing RNA crosslinked to NSP12 on the nitrocellulose layer, and the free RNA on the nylon layer. FIG.1G shows fraction of RNA bound to NSP12 from filter binding assay, using hairpin RNA from position 7414-7555 and scrambled RNA as negative control. FIG. 1H shows correlation matrix of mean fold change of eCLIP read density: Bottom left panels, 2D density plots; diagonal, density plot corresponding to samples in bottom labels; top right panels, Pearson’s coefficient between samples. FIG.1I shows a locked nucleic acid (LNA) designed to be complementary to the hairpin RNA from position 7414-7555. FIG.1J shows complementary LNA targeting SARS-CoV-2 genome shown to hybridize and disrupt the structure of the central hairpin as predicted by NUPACK software. FIG.1K shows immunofluorescence microscopy images showing fraction of infected cells as stained by SARS-CoV-2 nucleocapsid proteins (green), with cell nucleus stained with DAPI (green). FIG. 1L shows a bar plot showing mean relative fluorescence intensities of cells from FIG. 1K, dots represent segmented individual cells (mean ± s.e.m. * p<0.01, two-tailed t-test). FIG.2A shows a bar plot showing number of all genes, number of all peaks, number of coding genes and number of peaks mapping to coding genes from n = 2 biologically independent
replicates of NSP12, NSP8 and N eCLIP of SARS-CoV-2 infected cells. Target genes have at least one reproducible peak (by IDR12) associated with each protein. FIG. 2B shows a stacked bar plot showing TPM of reads mapped to the Vero E6 genome or SARS-CoV-2 genome in each of NSP12, NSP8 and N eCLIP. FIG. 2C shows a Venn diagram showing number of African Green Monkey (host) genes targeted by NSP8 and NSP12. FIG.2D shows a Violin plot showing the distribution of Log2FoldChange in transcript levels in Vero E6 cells infected by SARS-CoV-2, for significantly differentially expressed genes (adjusted P < 0.05). Kolmogorov–Smirnov test p-values between eCLIP targets of NSP12 and NSP8 versus all differentially expressed genes are indicated above the plot. FIG.2E shows Top 25 Enriched Gene Ontology (GO) processes (adjusted p < 0.01) for NSP12 target host genes. Box plot indicates quartiles of differential expression (log2(Fold Change)) of target genes (grey dots). FIG. 3A shows an exemplary schematic showing SARS-CoV-2 proteins individually tagged and expressed in human lung epithelial cells BEAS-2B to assay with eCLIP. FIG. 3B shows a bar plot indicating number of all genes, number of all peaks, number of coding genes and number of coding peaks found to interact with each protein from n = 2 biologically independent experiments. In addition to SARS-CoV-2 proteins, ENCODE eCLIP data for example human RNA-binding proteins (hRBPs) are included for comparison. Target genes have at least one reproducible peak (by IDR) associated with each protein. FIG. 3C shows a clustermap showing unique host coding genes (columns) targeted by each SARS-CoV-2 protein (rows). FIG. 3D shows example genome browser tracks for NSP3, NSP12, N and NSP2 mapping to DYNCH1, TUSC3, CXCL5 and NAP1L4 respectively. FIGs. 3E-3F show Western blots showing viral (pink background) and human (blue background) proteins enriched via CLASP (FIG.3E) and RIC (FIG.3F), with total cell lysate showed in input column (IN). FIG.3G shows Enriched Gene Ontology (GO) processes (adjusted p-value < 10-5) of unique eCLIP target coding genes for various SARS-CoV-2 proteins. FIG. 3H shows example sequence logos generated from all IDR peak reads for each SARS- CoV-2 eCLIP, with p-value indicated above each logo. FIG.4A shows a stacked bar plot showing fraction of reproducible peaks (by IDR14) mapping to different regions of coding genes.3ss, 3’ splice site; 3utr, 3’ untranslated region (UTR), 5ss, 5’ splice site; 5utr, 5’ UTR; CDS, coding sequence.
FIG.4B shows a clustermap showing read density of target RNA by each SARS-CoV-2 protein scaled to a metagene profile containing 5’ UTR, CDS and 3’ UTR regions. FIG.4C shows an exemplary schematic showing the Renilla-MS2 and Firefly dual luciferase reporter constructs, where individual SARS-CoV-2 proteins fused to MCP are recruited to the Renillia-MS2 mRNA. FIGs.4D-4E show bar plots showing luciferase reporter activity ratios (FIG.4D) and reporter RT-qPCR ratios (FIG.4E) for the indicated coexpressed SARS-CoV-2 protein, known human regulators of RNA stability (CNOT7, BOLL) and negative control (FLAG peptide). Ratios are normalized to the negative control (mean ± s.e.m., n = 3 biologically independent replicate transfections; * p<0.05, ** p<0.005, *** p<0.0005, **** p<0.0001, two-tailed multiple t-test; ns, not significant). FIG. 4F shows a bar plot showing the fold change of luciferase activity ratio and RT-qPCR ratio (mean ± s.e.m, n = 3; * p<0.05, *** p<0.001, two-tailed Welch’s t-test). FIG.4G shows a cumulative distributive plot (CDF) of log2(Fold Change) of gene expression in HEK293T cells transfected with a plasmid overexpressing NSP12 versus an empty vector plasmid. KS test p values indicate significance of difference in differential expression of NSP12 target genes versus non-eCLIP target genes. FIG.4H shows Enriched Gene Ontology (GO) processes (adjusted p < 10-4) of NSP12 target genes, with box plots indicating quartiles of differential expression (log2(Fold Change)) of target genes (black dots). FIG. 4I shows a CDF plot of Δlog2(Fold Change) of polysomal mRNA levels in BEAS-2B cells nucleofected with a plasmid overexpressing ORF9c versus an empty vector plasmid. KS test p values indicate significance of difference in differential expression of ORF9c target genes versus non-eCLIP target genes. FIG.4J shows Enriched BioPlanet pathways (adjusted p < 0.01) of ORF9c target genes, with box plots indicating quartiles of differential expression (Δlog2(Fold Change) of polysomal mRNA levels) of target genes (black dots). FIG. 4K shows immunofluorescence images (40X) of SARS-CoV-2 infected A549-ACE2 cells stained for SARS-CoV-2 NSP8 (red), endogenous genes (green), DNA content (blue). FIG. 4L shows a heat map showing infection rate as measured by the integrated intensity of immunofluorescence staining of SARS-CoV-2 nucleocapsid protein in human iPSC derived lung organoid cells. Cells were treated with siRNAs targeting different host genes prior to viral infection by three different variants of SARS-CoV-2. Significant differences in infection rates
were given by two-tailed t-test, * p<0.05, **p<0.01, ns, not significant, as compared to scrambled siRNA control for n = 3 biologically independent samples. FIG.5A shows a pie charts showing distribution of eCLIP peaks across different coding RNA regions for NSP2, NSP5, NSP7 and NSP9. Genomic content and exonic content are based on the hg19 human reference genome. FIG. 5B shows Jaccard index similarity of NSP9 target genes as compared with all 223 ENCODE RBP datasets. FIG. 5C shows metadensity of eCLIP reads truncation sites averaged across all RNA targets by SARS-CoV-2 NSP2, NSP5, NSP7 and NSP9, and U2AF1/2 from the ENCODE consortium, zoomed into the region 150 nt upstream of 3’ splice sites, and the region 150 nt downstream of the 5’ end of the last exon. FIG.5D shows an exemplary schematic illustrating a model of NSP9 interacting with nuclear pore complex proteins NUP62, NUP214, NUP58, NUP88 and NUP54, and inhibiting U2AF2 substrate recognition in preventing NXF1 facilitated transport. FIG. 5E shows cumulative distributive plot (CDF) of log2(Fold Change) of BEAS-2B cells overexpressing NSP9 versus wildtype BEAS-2B cells in each of nuclear, cytosolic, and total mRNA fractions. Solid line indicate NSP9 target genes, dashed lines indicate genes that are not NSP9 targets. FIG. 5F shows genome browser tracks of NSP9 eCLIP target RNA mapped to IL-1α, IL-1β, ANXA2 and UPP1. FIG. 5G shows a bar plot showing ratios of cytosolic to total fraction of mRNA levels measured by RTqPCR, in wild type (WT) BEAS-2B cells, and BEAS-2B cells transduced to express NSP9 (*p<0.05, **p<0.0005, two-tailed multiple t-test with pooled variance, n = 2 biologically independent replicates). FIG.5H shows a bar plot showing mean concentration of IL-1α in culture media from WT and NSP9 expressing BEAS-2B cells, 48h after induction by cytokines indicated on the x-axis (US, unstimulated; mean ± s.e.m, n = 3 biologically independent replicates; *p < 0.05, Tukey’s multiple comparisons test). FIG.5I shows a bar plot showing mean concentration of IL-1α in culture media from WT and NSP9 expressing BEAS-2B cells, 48h after induction by different levels of TNFα (mean ± s.e.m, n = 3 biologically independent replicates, *p<0.05, **p<0.005, two-tailed t-test). FIG.5J shows a bar plot showing mean concentration of IL-1β in culture media from WT and NSP9 expressing BEAS-2B cells, 48h after induction by 0 or 100 ng/ml TNFα (mean ± s.e.m, n = 3 biologically independent replicates, *p<0.05, **p<0.005, two-tailed t-test).
FIG. 6A shows an exemplary schematic illustrating the complex host-viral relationship. Flat- ended arrows indicate downregulation, pointed arrows indicate upregulation. Blue arrows are newly proposed interactions. FIG.6B shows an exemplary schematic illustrating the hypothesis that siRNA knockdown of host genes upregulated by SARS-CoV-2 proteins would result in reduced virus proliferation if the genes are pro-viral. FIG. 6C shows an exemplary schematic outlining the workflow of testing the antiviral effect of siRNAs targeting candidate pro-viral genes. siRNAs are transfected at 25 nM. Cells are infected at MOI of 3 for A549-ACE2 cells and MOI of 1 for lung organoid cells. FIGs. 6D-6E show bar plots showing the infection rate (mean fluorescence intensity of nucleocapsid divided by cell count) by normalized to the scrambled siRNA control in A549- ACE2 cells (FIG.6D) and lung organoid cells (FIG.6E). Cells were treated with siRNAs that target host genes, as indicated in the x-axis (mean ^ s.e.m, n = 3 biologically independent replicates; * p ^0.05, ** p ^0.005, two-tailed Welch’s t-test; ns, not significant). FIG. 7A shows secondary structures that overlap with the major peaks enriched in NSP12 eCLIP at positions 3533-3635, 17202-17222, 21177-21206, and 24018-24079. FIG.7B shows results from a filter-binding assay showing RNA crosslinked to NSP12 on the nitrocellulose layer, and the free RNA on the nylon layer. FIG. 7C shows multiple sequence alignment of the region corresponding among the SARS- CoV-2 reference sequence, and homologous bat and pangolin sequences. Consensus sequence and structure are indicated underneath the aligned sequences. FIG.7D shows a phylogenetic tree constructed from FIG.7C. FIG. 7E shows a RNA-seq read density plot from SARS-CoV-2 infected A549-ACE2 cells mapping sequenced reads to the sense strand of the SARS-CoV-2 genome. DETAILED DESCRIPTION Detailed herein are methods of modulating gene expression of a target RNA that include delivering an exogenous nucleic acid into a cell, wherein the exogenous nucleic acid comprises an antisense oligonucleotide or a small interfering RNA (siRNA), and wherein the exogenous nucleic acid binds to the target RNA, thereby modulating gene expression of the target RNA in the cell.
Various non-limiting aspects of these methods are described herein, and can be used in any combination without limitation. Additional aspects of various components of methods for modulating gene expression are known in the art. It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. As used herein, the terms “about” and “approximately,” when used to modify an amount specified in a numeric value or range, indicate that the numeric value as well as reasonable deviations from the value known to the skilled person in the art, for example ± 20%, ± 10%, or ± 5%, are within the intended meaning of the recited value. As used herein, a “cell” can refer to either a prokaryotic or eukaryotic cell, optionally obtained from a subject or a commercially available source. As used herein, “delivering”, “gene delivery”, “gene transfer”, “transducing” can refer to the introduction of an exogenous polynucleotide into a host cell, irrespective of the method used for the introduction. Such methods include a variety of well-known techniques such as vector-mediated gene transfer (e.g., viral infection/transfection, or various other protein-based or lipid-based gene delivery complexes) as well as techniques facilitating the delivery of “naked” polynucleotides (e.g., electroporation, “gene gun” delivery and various other techniques used for the introduction of polynucleotides). The introduced polynucleotide may be stably or transiently maintained in the host cell. Stable maintenance typically requires that the introduced polynucleotide either contains an origin of replication compatible with the host cell or integrates into a replicon of the host cell such as an extrachromosomal replicon (e.g., a plasmid) or a nuclear or mitochondrial chromosome. In some embodiments, a polynucleotide can be inserted into a host cell by a gene delivery molecule. Examples of gene delivery molecules can include, but are not limited to, liposomes, micelles biocompatible polymers, including natural polymers and synthetic polymers; lipoproteins; polypeptides; polysaccharides; lipopolysaccharides; artificial viral envelopes; metal particles; and bacteria, or viruses, such as baculovirus, adenovirus and retrovirus, bacteriophage, cosmid, plasmid, fungal vectors and other recombination vehicles typically used in the art which have been described for expression in a variety of eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple protein expression. As used herein, the term “encode” as it is applied to nucleic acid sequences refers to a polynucleotide which is said to “encode” a polypeptide if, in its native state or when manipulated by methods well known to those skilled in the art, can be transcribed and/or
translated to produce the mRNA for the polypeptide and/or a fragment thereof. The antisense strand is the complement of such a nucleic acid, and the encoding sequence can be deduced therefrom. As used herein, the term “exogenous” refers to any material introduced from or originating from outside a cell, a tissue or an organism that is not produced by or does not originate from the same cell, tissue, or organism in which it is being introduced. As used herein, the term “expression” refers to the process by which polynucleotides are transcribed into mRNA and/or the process by which the transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. In some embodiments, if the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell. The expression level of a gene may be determined by measuring the amount of mRNA or protein in a cell or tissue sample; further, the expression level of multiple genes can be determined to establish an expression profile for a particular sample. As used herein, “nucleic acid” is used to include any compound and/or substance that comprise a polymer of nucleotides. In some embodiments, a polymer of nucleotides are referred to as polynucleotides. Exemplary nucleic acids or polynucleotides can include, but are not limited to, ribonucleic acids (RNAs), deoxyribonucleic acids (DNAs), threose nucleic acids (TNAs), glycol nucleic acids (GNAs), peptide nucleic acids (PNAs), locked nucleic acids (LNAs, including LNA having a β-D-ribo configuration, α-LNA having an ^-L-ribo configuration (a diastereomer of LNA), 2’-amino-LNA having a 2’-amino functionalization, and 2’-amino- ^-LNA having a 2’-amino functionalization) or hybrids thereof. Naturally- occurring nucleic acids generally have a deoxyribose sugar (e.g., found in deoxyribonucleic acid (DNA)) or a ribose sugar (e.g., found in ribonucleic acid (RNA)). A nucleic acid can contain nucleotides having any of a variety of analogs of these sugar moieties that are known in the art. A deoxyribonucleic acid (DNA) can have one or more bases selected from the group consisting of adenine (A), thymine (T), cytosine (C), or guanine (G), and a ribonucleic acid (RNA) can have one or more bases selected from the group consisting of uracil (U), adenine (A), cytosine (C), or guanine (G). In some embodiments, the term “nucleic acid” refers to a deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), or a combination thereof, in either a single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides that have similar binding properties as the reference nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses
complementary sequences as well as the sequence explicitly indicated. In some embodiments of any of the isolated nucleic acids described herein, the isolated nucleic acid is DNA. In some embodiments of any of the isolated nucleic acids described herein, the isolated nucleic acid is RNA. Modifications can be introduced into a nucleotide sequence by standard techniques known in the art, such as site-directed mutagenesis and polymerase chain reaction (PCR)-mediated mutagenesis. Conservative amino acid substitutions are ones in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., arginine, lysine and histidine), acidic side chains (e.g., aspartic acid and glutamic acid), uncharged polar side chains (e.g., asparagine, cysteine, glutamine, glycine, serine, threonine, tyrosine, and tryptophan), nonpolar side chains (e.g., alanine, isoleucine, leucine, methionine, phenylalanine, proline, and valine), beta-branched side chains (e.g., isoleucine, threonine, and valine), and aromatic side chains (e.g., histidine, phenylalanine, tryptophan, and tyrosine), and aromatic side chains (e.g., histidine, phenylalanine, tryptophan, and tyrosine). Unless otherwise specified, a “nucleotide sequence encoding a protein” includes all nucleotide sequences that are degenerate versions of each other and thus encode the same amino acid sequence. As used herein, the term “plurality” can refer to a state of having a plural (e.g., more than one) number of different types of things (e.g., a cell, a genomic sequence, a subject, a system, or a protein). In some embodiments, a plurality of genomic sequences can be more than one genomic sequence wherein each genomic sequence is different from each other. As used herein, the term “subject” is intended to include any mammal. In some embodiments, the subject is cat, a dog, a goat, a human, a non-human primate, a rodent (e.g., a mouse or a rat), a pig, or a sheep. As used herein, the term “transduced”, “transfected”, or “transformed” refers to a process by which exogenous nucleic acid is introduced or transferred into a cell. A “transduced,” “transfected,” or “transformed” mammalian cell is one that has been transduced, transfected or transformed with exogenous nucleic acid (e.g., a gene delivery vector) that includes an exogenous nucleic acid encoding RNA-binding zinc finger domain). As used herein, the term “treating” means a reduction in the number, frequency, severity, or duration of one or more (e.g., two, three, four, five, or six) symptoms of a disease or disorder in a subject (e.g., any of the subjects described herein), and/or results in a decrease
in the development and/or worsening of one or more symptoms of a disease or disorder in a subject. Inhibitory Nucleic Acids Disclosed herein are inhibitory nucleic acids which block the interaction between a host and SARS-CoV2-associated RNA or protein. In some embodiments, the inhibitory nucleic acids are oligonucleotides. In some embodiments, the inhibitory nucleic acids are antisense oligonucleotides. In some embodiments, the inhibitory nucleic acids are modified oligonucleotides. In some embodiments, the inhibitory nucleic acids are locked nucleic acids (LNAs). In some embodiments, the modified oligonucleotides are antisense oligonucleotides that hybridize to viral RNA. In some embodiments, the inhibitory nucleic acids are designed to target a specific region of viral RNA. For example, a specific functional region can be targeted. In some embodiments, the inhibitory nucleic acids hybridize to a viral genome. In some embodiments, the inhibitory nucleic acids are complementary to and/or hybridize to SARS-CoV2. In some embodiments, the inhibitory nucleic acids can be used in model systems, e.g., lung organoids. In some embodiments, the inhibitory nucleic acids are complementary to and/or hybridize and/or bind to host RNA. In some embodiments, the inhibitory nucleic acids hybridize to the polyA tail of host mRNA. In some embodiments, the inhibitory nucleic acids can be used to treat viral infections in a host. In some embodiments, the inhibitory nucleic acids can be used in pharmaceutical compositions to achieve a therapeutic effect, e.g., reducing or alleviating the symptoms of a disease in a patient. In some embodiments, the disease is a viral infection. In some embodiments, the therapeutic effect includes a reduction or elimination of symptoms including fever or chills, cough, shortness of breath or difficulty breathing, fatigue, muscle or body aches, headache, new loss of taste or smell, sore throat, congesting or runny nose, nausea or vomiting, diarrhea. In some embodiments, the pharmaceutical composition comprises an inhibitory nucleic acid, e.g., an antisense oligonucleotide that is complementary to one or more viral RNAs, which may cause the disease in the patient. In some embodiments, the inhibitory nucleic acid may be complementary to one or more host RNAs. In some embodiments, as described in further detail below, the antisense oligonucleotide includes different modifications, e.g., in the sugar backbone to make it more cell permeable and nuclease resistant and physiologically non-toxic at low concentrations.
Inhibitory nucleic acids useful in the present methods and compositions include antisense oligonucleotides, modified bases/locked nucleic acids (LNAs), antagomirs, peptide nucleic acids (PNAs), double stranded RNA species, siRNAs, morpholinos, and other oligomeric compounds or oligonucleotide mimetics which hybridize to at least a portion of the target nucleic acid (i.e., an viral RNA gene, e.g., a SARS-CoV2 gene) and modulate its abundance, splicing, post-transcriptional processing, or translation; see, e.g., U.S. Patent Nos. 9,045,749 and 9,476,046. In some embodiments, the inhibitory nucleic acids include antisense RNA, antisense DNA, chimeric antisense oligonucleotides, antisense oligonucleotides comprising modified linkages, or combinations thereof. See, e.g., WO 2010040112. In some embodiments, the inhibitory nucleic acids are LNAs. A locked nucleic acid is a nucleotide having a modified ribose moiety in which the ribose moiety comprises an extra bridge connecting the 2′ and 4′ carbons. This structure effectively “locks” the ribose in the 3′- endo structural conformation. The addition of locked nucleic acids to siRNAs has been shown to increase siRNA stability in serum, and to reduce off-target effects (Elmen, J. et al., (2005) Nucleic Acids Research 33(1):439-447; Mook, O R. et al., (2007) Mol Canc Ther 6(3):833- 843; Grunweller, A. et al., (2003) Nucleic Acids Research 31(12):3185-3193). In some embodiments, LNA molecules can include molecules comprising 10-30, e.g., 12-24, e.g., 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in each strand, wherein one of the strands is substantially identical, e.g., at least 80% (or more, e.g., 85%, 90%, 95%, or 100%) identical, e.g., having 3, 2, 1, or 0 mismatched nucleotide(s), to a SARS-CoV2 sequence. The LNA molecules can be chemically synthesized using methods known in the art. The LNA molecules can be designed using any method known in the art; a number of algorithms are known, and are commercially available (e.g., on the internet, for example at exiqon.com). See, e.g., You et al., Nuc. Acids. Res. 34:e60 (2006); McTigue et al., Biochemistry 43:5388-405 (2004); and Levin et al., Nuc. Acids. Res. 34:e142 (2006). For example, “gene walk” methods, similar to those used to design antisense oligos, can be used to optimize the inhibitory activity of the LNA (or any other inhibitory nucleic acid described herein); for example, a series of oligonucleotides of 10-30 nucleotides spanning the length of a target SARS-CoV2 sequence can be prepared, followed by testing for activity. Optionally, gaps, e.g., of 5-10 nucleotides or more, can be left between the LNAs to reduce the number of oligonucleotides synthesized and tested. GC content is preferably between about 30 and 60%. General guidelines for designing LNAs are known in the art; for example, LNA sequences will bind very tightly to other LNA sequences, so it is preferable to avoid significant
complementarity within an LNA. Contiguous runs of three or more Gs or Cs, or more than four LNA residues, should be avoided where possible (for example, it may not be possible with very short (e.g., about 9-10 nt) oligonucleotides). In some embodiments, the LNAs are xylo- LNAs. For additional information regarding LNAs see U.S. Pat. Nos. 6,268,490; 6,734,291; 6,770,748; 6,794,499; 7,034,133; 7,053,207; 7,060,809; 7,084,125; and 7,572,582; and U.S. Pre-Grant Pub. Nos. 20100267018; 20100261175; and 20100035968; Koshkin et al. Tetrahedron 54, 3607–3630 (1998); Obika et al. Tetrahedron Lett. 39, 5401–5404 (1998); Jepsen et al., Oligonucleotides 14:130–146 (2004); Kauppinen et al., Drug Disc. Today 2(3):287-290 (2005); and Ponting et al., Cell 136(4):629–641 (2009), and references cited therein. In some embodiments, the inhibitory nucleic acids are siRNA. The term “siRNA” refers to small inhibitory RNA duplexes that induce the RNA interference (RNAi) pathway. As used herein, a “small interfering RNA” (siRNA) sometimes known as “short interfering RNA” or “silencing RNA”, refers to a class of double-stranded RNA (e.g., non-coding RNA molecules), typically 18-30 base pairs, 20-24 base pairs, (e.g., 20 base pairs, 21 base pairs, 22 base pairs, 23 base pairs, or 24 base pairs) in length, and operating within the RNA interference (RNAi) pathway. RNA interference can offer a therapeutic approach for treating viral infections, wherein small interfering RNAs (siRNAs) can be used to target a viral sequence and inhibit (i.e., reduce or eliminate) viral gene expression, and also for treatment of a viral infection in a mammal, such as a human. In some embodiments, the small interfering RNA constructs can inhibit gene expression of a virus by inducing cleavage of viral polynucleotide sequences within or near the target RNA sequence that is recognized by the antisense sequence of the small interfering RNA. In some embodiments, siRNAs interfere with the expression of specific genes with complementary nucleotide sequences by degrading mRNA after transcription, preventing translation. These small inhibitory RNA molecules can vary in length (generally 18-30 base pairs) and contain varying degrees of complementarity to their target mRNA in the antisense strand. Some, but not all, siRNA have unpaired overhanging bases on the 5′ or 3′ end of the sense strand and/or the antisense strand. The term “siRNA” includes duplexes of two separate strands, as well as single strands that can form hairpin structures comprising a duplex region. In some embodiments, a small interfering RNA can refer to an RNA construct that contains one or more short sequences that are at least partially complementary to and can interact with a polynucleotide sequence of a virus (e.g., viral RNA). Interaction may be in the
form of a direct binding between complementary (antisense) sequences of the small interfering RNA and polynucleotide sequences of the viral target, or in the form of an indirect interaction via enzymatic machinery (e.g., a protein complex) that allows the antisense sequence of the small interfering RNA to recognize the target sequence. In some embodiments, recognition of the target sequence by the small interfering RNA results in cleavage of viral sequences within or near the target site that is recognized by the recognition (antisense) sequence of the small interfering RNA. The small interfering RNA can exclusively contain ribonucleotide residues, or the small interfering RNA can contain one or more modified residues, particularly at the ends of the small interfering RNA or on the sense strand of the small interfering RNA. In some embodiments, small interfering RNA encompasses shRNA and siRNA, both of which are understood and known to those in the art to refer to RNA constructs with particular characteristics and types of configurations. In some embodiments, the inhibitory nucleic acids are antisense oligonucleotides. Antisense oligonucleotides are typically designed to inhibit expression of a DNA or RNA target by binding to the target and halting expression at the level of transcription, translation, or splicing. Antisense oligonucleotides of the present compositions and methods are complementary nucleic acid sequences designed to a target viral RNA to achieve a therapeutic effect in a patient. Thus, inhibitory nucleic acids and/or oligonucleotides are chosen that are sufficiently complementary to the target, i.e., that hybridize sufficiently well and with sufficient specificity, to give the desired therapeutic effect. In some embodiments, the inhibitory nucleic acids are 9 to 50, 9 to 21, 13 to 50, or 13 to 30 nucleotides in length. One having ordinary skill in the art will appreciate that this embodies oligonucleotides having antisense portions of 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides in length, or any range therein. In some embodiments, the oligonucleotides are 15 nucleotides in length. In some embodiments, the antisense or oligonucleotide compounds of the invention are 12 or 13 to 30 nucleotides in length. One having ordinary skill in the art will appreciate that this embodies inhibitory nucleic acids having antisense portions of 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 nucleotides in length, or any range therein. In some embodiments, the inhibitory nucleic acids are chimeric oligonucleotides that contain two or more chemically distinct regions, each made up of at least one nucleotide. These oligonucleotides typically contain at least one region of modified nucleotides that confers one or more beneficial properties (such as, for example, increased nuclease resistance, increased
uptake into cells, increased binding affinity for the target) and a region that is a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids. Chimeric inhibitory nucleic acids of the invention may be formed as composite structures of two or more oligonucleotides, modified oligonucleotides, oligonucleosides and/or oligonucleotide mimetics as described above. Such compounds have also been referred to in the art as hybrids or gapmers. In some embodiments, the oligonucleotide is a gapmer (contain a central stretch (gap) of DNA monomers sufficiently long to induce RNase H cleavage, flanked by blocks of LNA modified nucleotides; see, e.g., Stanton et al., Nucleic Acid Ther. 2012. 22: 344-359; Nowotny et al., Cell, 121:1005–1016, 2005; Kurreck, European Journal of Biochemistry 270:1628-1644, 2003; Fluiter et al., Mol Biosyst.5(8):838-43, 2009). In some embodiments, the oligonucleotide is a mixmer (includes alternating short stretches of LNA and DNA; Naguibneva et al., Biomed Pharmacother. 2006 Nov; 60(9):633-8; Ørom et al., Gene. 2006 May 10; 372():137-41). Representative United States patents that teach the preparation of such hybrid structures comprise, but are not limited to, US patent nos. 5,013,830; 5,149,797; 5,220,007; 5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, each of which is herein incorporated by reference. In some embodiments, the inhibitory nucleic acid comprises at least one nucleotide modified at the 2' position of the sugar, most preferably a 2'-O-alkyl, 2'-O-alkyl-O-alkyl or 2'- fluoro-modified nucleotide. In other preferred embodiments, RNA modifications include 2'- fluoro, 2'-amino and 2' O-methyl modifications on the ribose of pyrimidines, abasic residues or an inverted base at the 3' end of the RNA. Such modifications are routinely incorporated into oligonucleotides and these oligonucleotides have been shown to have a higher Tm (i.e., higher target binding affinity) than; 2'-deoxyoligonucleotides against a given target. A number of nucleotide and nucleoside modifications have been shown to make the oligonucleotide into which they are incorporated more resistant to nuclease digestion than the native oligodeoxynucleotide; these modified oligos survive intact for a longer time than unmodified oligonucleotides. Specific examples of modified oligonucleotides include those comprising modified backbones, for example, phosphorothioates, phosphotriesters, methyl phosphonates, short chain alkyl or cycloalkyl intersugar linkages or short chain heteroatomic or heterocyclic intersugar linkages. Most preferred are oligonucleotides with phosphorothioate backbones and those with heteroatom backbones, particularly CH2 -NH-O-CH2, CH,-N(CH3)- O-CH2 (known as a methylene(methylimino) or MMI backbone], CH2 --O--N (CH3)-CH2, CH2 -N (CH3)-N (CH3)-CH2 and O-N (CH3)- CH2 -CH2 backbones, wherein the native phosphodiester backbone is represented as O- P-- O- CH,); amide backbones (see De
Mesmaeker et al. Ace. Chem. Res. 1995, 28:366-374); morpholino backbone structures (see Summerton and Weller, U.S. Pat. No. 5,034,506); peptide nucleic acid (PNA) backbone (wherein the phosphodiester backbone of the oligonucleotide is replaced with a polyamide backbone, the nucleotides being bound directly or indirectly to the aza nitrogen atoms of the polyamide backbone, see Nielsen et al., Science 1991, 254, 1497). Phosphorus-containing linkages include, but are not limited to, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates comprising 3'alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates comprising 3'-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3'-5' to 5'-3' or 2'-5' to 5'-2'; see US patent nos.3,687,808; 4,469,863; 4,476,301; 5,023,243; 5, 177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455, 233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563, 253; 5,571,799; 5,587,361; and 5,625,050. Morpholino-based oligomeric compounds are described in Dwaine A. Braasch and David R. Corey, Biochemistry, 2002, 41(14), 4503-4510); Genesis, volume 30, issue 3, 2001; Heasman, J., Dev. Biol., 2002, 243, 209-214; Nasevicius et al., Nat. Genet., 2000, 26, 216-220; Lacerra et al., Proc. Natl. Acad. Sci., 2000, 97, 9591-9596; and U.S. Pat. No.5,034,506, issued Jul.23, 1991. Modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These comprise those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts; see US patent nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264, 562; 5, 264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596, 086; 5,602,240; 5,610,289; 5,602,240; 5,608,046;
5,610,289; 5,618,704; 5,623, 070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which is herein incorporated by reference. One or more substituted sugar moieties can also be included, e.g., one of the following at the 2' position: OH, SH, SCH3, F, OCN, OCH3 OCH3, OCH3 O(CH2)n CH3, O(CH2)n NH2 or O(CH2)nCH3 where n is from 1 to about 10; Ci to C10 lower alkyl, alkoxyalkoxy, substituted lower alkyl, alkaryl or aralkyl; Cl; Br; CN; CF3 ; OCF3; O-, S , or N-alkyl; O-, S-, or N-alkenyl; SOCH3; SO2 CH3; ONO2; NO2; N3; NH2; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; substituted silyl; an RNA cleaving group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties of an oligonucleotide; or a group for improving the pharmacodynamic properties of an oligonucleotide and other substituents having similar properties. A preferred modification includes 2'-methoxyethoxy [2'- O-CH2CH2OCH3, also known as 2'-O-(2-methoxyethyl)] (Martin et al, HeIv. Chim. Acta, 1995, 78, 486). Other preferred modifications include 2'-methoxy (2'-O-CH3), 2'-propoxy (2'-OCH2 CH2CH3) and 2'-fluoro (2'-F). Similar modifications may also be made at other positions on the oligonucleotide, particularly the 3' position of the sugar on the 3' terminal nucleotide and the 5' position of 5' terminal nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyls in place of the pentofuranosyl group. Inhibitory nucleic acids can also include, additionally or alternatively, nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include adenine (A), guanine (G), thymine (T), cytosine (C) and uracil (U). Modified nucleobases include nucleobases found only infrequently or transiently in natural nucleic acids, e.g., hypoxanthine, 6-methyladenine, 5-Me pyrimidines, particularly 5-methylcytosine (also referred to as 5-methyl-2' deoxycytosine and often referred to in the art as 5-Me-C), 5-hydroxymethylcytosine (HMC), glycosyl HMC and gentobiosyl HMC, as well as synthetic nucleobases, e.g., 2-aminoadenine, 2- (methylamino)adenine, 2- (imidazolylalkyl)adenine, 2-(aminoalklyamino)adenine or other heterosubstituted alkyladenines, 2-thiouracil, 2-thiothymine, 5-bromouracil, 5- hydroxymethyluracil, 8- azaguanine, 7-deazaguanine, N6 (6-aminohexyl)adenine and 2,6- diaminopurine. Kornberg, A., DNA Replication, W. H. Freeman & Co., San Francisco, 1980, pp75-77; Gebeyehu, G., et al. Nucl. Acids Res.1987, 15:4513). A “universal” base known in the art, e.g., inosine, can also be included.5-Me-C substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2<0>C. (Sanghvi, Y. S., in Crooke, S. T. and Lebleu, B., eds., Antisense Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are presently preferred base substitutions.
It is not necessary for all positions in a given oligonucleotide to be uniformly modified, and in fact more than one of the aforementioned modifications may be incorporated in a single oligonucleotide or even at within a single nucleoside within an oligonucleotide. In some embodiments, both a sugar and an internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar- backbone of an oligonucleotide is replaced with an amide containing backbone, for example, an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the preparation of PNA compounds comprise, but are not limited to, US patent nos. 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found in Nielsen et al, Science, 1991, 254, 1497-1500. Inhibitory nucleic acids can also include one or more nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases comprise the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases comprise other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudo-uracil), 4-thiouracil, 8-halo, 8-amino, 8- thiol, 8- thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5- bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7- methylquanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7- deazaadenine and 3- deazaguanine and 3-deazaadenine. In some embodiments, the inhibitory nucleic acids are chemically linked to one or more moieties or conjugates that enhance the activity, cellular distribution, or cellular uptake of the oligonucleotide. Such moieties comprise but are not limited to, lipid moieties such as a cholesterol moiety, cholic acid, a thioether, e.g., hexyl-S- tritylthiol, a thiocholesterol, an aliphatic chain, e.g., dodecandiol or undecyl residues, a phospholipid, e.g., di-hexadecyl-rac- glycerol or triethylammonium 1 ,2-di-O-hexadecyl- rac-glycero-3-H-phosphonate, a
polyamine or a polyethylene glycol chain, a palmityl moiety, or an octadecylamine or hexylamino-carbonyl-t oxycholesterol moiety. These moieties or conjugates can include conjugate groups covalently bound to functional groups such as primary or secondary hydroxyl groups. Conjugate groups of the invention include intercalators, reporter molecules, polyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of oligomers. Typical conjugate groups include cholesterols, lipids, phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluoresceins, rhodamines, coumarins, and dyes. Groups that enhance the pharmacodynamic properties, in the context of this invention, include groups that improve uptake, enhance resistance to degradation, and/or strengthen sequence-specific hybridization with the target nucleic acid. Groups that enhance the pharmacokinetic properties, in the context of this invention, include groups that improve uptake, distribution, metabolism or excretion of the compounds of the present invention. The inhibitory nucleic acids useful in the present compositions and methods are sufficiently complementary to all or part of a target viral RNA, i.e., hybridize sufficiently well and with sufficient specificity, to give the desired therapeutic effect. “Complementary” refers to the capacity for pairing, through hydrogen bonding, between two sequences comprising naturally or non-naturally occurring bases or analogs thereof. For example, if a base at one position of an inhibitory nucleic acid is capable of hydrogen bonding with a base at the corresponding position of a target viral RNA, then the bases are considered to be complementary to each other at that position.100% complementarity is not required. In the context of the present compositions and methods, hybridization means hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases. For example, adenine and thymine are complementary nucleobases which pair through the formation of hydrogen bonds. Complementary, as used herein, refers to the capacity for precise pairing between two nucleotides. The inhibitory nucleic acids and the target viral RNA are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleotides that can hydrogen bond with each other. Thus, “specifically hybridizable” and “complementary” are terms which are used to indicate a sufficient degree of complementarity or precise pairing such that stable and specific binding occurs between the inhibitory nucleic acid and the target viral RNA. For example, if a base at one position of an inhibitory nucleic
acid is capable of hydrogen bonding with a base at the corresponding position of the target viral RNA, then the bases are considered to be complementary to each other at that position. Although in some embodiments, 100% complementarity is desirable, it is understood in the art that a complementary nucleic acid sequence need not be 100% complementary to that of its target nucleic acid to be specifically hybridisable. A complementary nucleic acid sequence for purposes of the present compositions and methods is specifically hybridisable when binding of the sequence to the target viral RNA interferes with the abundance, splicing, post-transcriptional processing, or translation of the mRNA transcript, and there is a sufficient degree of complementarity to avoid non-specific binding of the sequence to non-target nucleic acid molecules under conditions in which specific binding is desired, e.g., under physiological conditions. In general, the inhibitory nucleic acids useful in the compositions and methods described herein have at least 80% sequence complementarity to a target region within the target nucleic acid, e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence complementarity to the target region. For example, an antisense compound in which 18 of 20 nucleobases of the antisense oligonucleotide are complementary, and would therefore specifically hybridize, to a target region would represent 90 percent complementarity. Percent complementarity of an inhibitory nucleic acid with a region of a target nucleic acid can be determined routinely using basic local alignment search tools (BLAST programs) (Altschul et al., J. Mol. Biol., 1990, 215, 403-410; Zhang and Madden, Genome Res., 1997, 7, 649-656). In general the inhibitory nucleic acids must retain specificity for their target, i.e., must not directly bind to, or directly significantly affect expression levels of, transcripts other than the intended target viral RNA. In some embodiments, the inhibitory nucleic acid is capable of reducing viral infection in a human cell (e.g., HEK293 cell, an iPSC cell, BEAS-2B cell, or A549-ACE2 cell). In some embodiments, the viral infection is reduced by at least about 30%, by at least about 35%, by at least about 40%, by at least about 45%, by at least about 50%, by at least about 55%, by at least about 60%, by at least about 65%, by at least about 70%, by at least about 75%, by at least about 80%, by at least about 85%, by at least about 90%, by at least about 95%, or about 100% compared to a human cell that is not exposed to the inhibitory nucleotide. In some embodiments, the pharmaceutical composition is an inhibitory nucleic acids comprising a sequence selected from SEQ ID NOs: 1-40 (Table 1). In some embodiments, the inhibitory nucleic acid includes a nucleic acid complementary to a sequence selected from SEQ ID NOs: 1-40. In some embodiments, the inhibitory nucleic acid includes a double stranded
inhibitory nucleic acid comprising at least one sequence selected from SEQ ID NO: 1-40. In some embodiments, an inhibitory nucleic acid includes a sequence complement to at least one sequence selected from SEQ ID NO: 1-40. In some embodiments, the pharmaceutical composition is an inhibitory nucleotide comprising any of SEQ ID NOs: 1-40 (and/or its complement) wherein all or some of the nucleotides have been modified in one or more ways as described above, e.g., with a phosphorothioate backbone, one or more 2'-methoxyethoxy modifications, one or more 2'-methoxy modifications, one or more 2'-propoxy modifications, or one or more 2'-fluoro modifications. In some embodiments, the pharmaceutical composition comprises any of SEQ ID NOs: 1-40, and/or its complement. In some embodiments the pharmaceutical composition comprising any of SEQ ID NOs: 1-40, and its complement, is therapeutically effective for treating a disease caused by SARS-CoV2. [Table 1]
#where rN is a ribonucleoside for rN ={rA,rU,rC,rG}, and N without any r prepended is a deoxyribonucleoside for N={A,T,C,G} Pharmaceutical Compositions The methods described herein can include the administration of pharmaceutical compositions and formulations comprising inhibitory nucleic acid sequences designed to target SARS-CoV2 RNA or host RNA that interacts with SARS-CoV2 proteins. In some embodiments, the compositions are formulated with a pharmaceutically acceptable carrier. The pharmaceutical compositions and formulations can be administered parenterally, topically, orally or by local administration, such as by aerosol or transdermally. The pharmaceutical compositions can be formulated in any way and can be administered in a variety of unit dosage forms depending upon the condition or disease and the degree of illness, the general medical condition of each patient, the resulting preferred method of administration and the like. Details on techniques for formulation and administration of pharmaceuticals are well described in the scientific and patent literature, see, e.g., Remington: The Science and Practice of Pharmacy, 21st ed., 2005. The inhibitory nucleic acids can be administered alone or as a component of a pharmaceutical formulation (composition). The compounds may be formulated for administration, in any convenient way for use in human or veterinary medicine. The compositions may conveniently be presented in unit dosage form and may be prepared by any methods well known in the art of pharmacy. The amount of active ingredient (e.g., nucleic acid sequences of this invention) which can be combined with a carrier material to produce a single dosage form can vary depending upon the host being treated, the particular mode of administration. The amount of active ingredient which can be combined with a carrier material to produce a single dosage form will generally be that amount of the compound which produces a therapeutic effect, e.g., reduction in SARS-CoV2 infection levels and/or reduction in a symptom of COVID19.
Pharmaceutical compositions described herein can be prepared according to any method known to the art for the manufacture of pharmaceuticals. Such compositions can contain, for example, preserving agents. A composition can be admixtured with nontoxic pharmaceutically acceptable excipients which are suitable for manufacture. Compositions may comprise one or more diluents, emulsifiers, preservatives, buffers, excipients, etc. and may be provided in such forms as liquids, powders, emulsions, lyophilized powders, controlled release formulations, on patches, in implants, etc. Wetting agents, emulsifiers, and lubricants, such as sodium lauryl sulfate and magnesium stearate, as well as coloring agents, release agents, coating agents, sweetening, flavoring and perfuming agents, preservatives and antioxidants can also be present in the compositions. Aqueous suspensions can contain an active agent (e.g., nucleic acid sequences of the invention) in admixture with excipients suitable for the manufacture of aqueous suspensions, e.g., for aqueous intradermal injections. Such excipients include a suspending agent, such as sodium carboxymethylcellulose, methylcellulose, hydroxypropylmethylcellulose, sodium alginate, polyvinylpyrrolidone, gum tragacanth and gum acacia, and dispersing or wetting agents such as a naturally occurring phosphatide (e.g., lecithin), a condensation product of an alkylene oxide with a fatty acid (e.g., polyoxyethylene stearate), a condensation product of ethylene oxide with a long chain aliphatic alcohol (e.g., heptadecaethylene oxycetanol), a condensation product of ethylene oxide with a partial ester derived from a fatty acid and a hexitol (e.g., polyoxyethylene sorbitol mono-oleate), or a condensation product of ethylene oxide with a partial ester derived from fatty acid and a hexitol anhydride (e.g., polyoxyethylene sorbitan mono-oleate). The aqueous suspension can also contain one or more preservatives such as ethyl or n-propyl p-hydroxybenzoate, one or more coloring agents, one or more flavoring agents and one or more sweetening agents, such as sucrose, aspartame or saccharin. Formulations can be adjusted for osmolarity. In some embodiments, oil-based pharmaceuticals are used for administration of nucleic acid sequences as described herein. As an example of an injectable oil vehicle, see Minto (1997) J. Pharmacol. Exp. Ther.281:93-102. Pharmaceutical compositions can also be in the form of oil-in-water emulsions. The oily phase can be a vegetable oil or a mineral oil, described above, or a mixture of these. Suitable emulsifying agents include naturally-occurring gums, such as gum acacia and gum tragacanth, naturally occurring phosphatides, such as soybean lecithin, esters or partial esters derived from fatty acids and hexitol anhydrides, such as sorbitan mono-oleate, and condensation products of these partial esters with ethylene oxide, such as polyoxyethylene
sorbitan mono-oleate. The emulsion can also contain sweetening agents and flavoring agents, as in the formulation of syrups and elixirs. Such formulations can also contain a demulcent, a preservative, or a coloring agent. In alternative embodiments, these injectable oil-in-water emulsions of the invention comprise a paraffin oil, a sorbitan monooleate, an ethoxylated sorbitan monooleate and/or an ethoxylated sorbitan trioleate. In some embodiments, the pharmaceutical compositions can also be delivered as microspheres for slow release in the body. For example, microspheres can be administered via intradermal injection of drug which slowly release subcutaneously; see Rao (1995) J. Biomater Sci. Polym. Ed. 7:623-645; as biodegradable and injectable gel formulations, see, e.g., Gao (1995) Pharm. Res. 12:857-863 (1995); or, as microspheres for oral administration, see, e.g., Eyles (1997) J. Pharm. Pharmacol.49:669-674. In some embodiments, the pharmaceutical compositions can be parenterally administered, such as by intravenous (IV) administration or administration into a body cavity or lumen of an organ. These formulations can comprise a solution of active agent dissolved in a pharmaceutically acceptable carrier. Acceptable vehicles and solvents that can be employed are water and Ringer's solution, an isotonic sodium chloride. In addition, sterile fixed oils can be employed as a solvent or suspending medium. For this purpose any bland fixed oil can be employed including synthetic mono- or diglycerides. In addition, fatty acids such as oleic acid can likewise be used in the preparation of injectables. These solutions are sterile and generally free of undesirable matter. These formulations may be sterilized by conventional, well known sterilization techniques. The formulations may contain pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate and the like. The concentration of active agent in these formulations can vary widely, and will be selected primarily based on fluid volumes, viscosities, body weight, and the like, in accordance with the particular mode of administration selected and the patient's needs. For IV administration, the formulation can be a sterile injectable preparation, such as a sterile injectable aqueous or oleaginous suspension. This suspension can be formulated using those suitable dispersing or wetting agents and suspending agents. The sterile injectable preparation can also be a suspension in a nontoxic parenterally- acceptable diluent or solvent, such as a solution of 1,3-butanediol. The administration can be by bolus or continuous infusion (e.g., substantially uninterrupted introduction into a blood vessel for a specified period of time).
In some embodiments, the pharmaceutical compounds and formulations can be lyophilized. Stable lyophilized formulations comprising an inhibitory nucleic acid can be made by lyophilizing a solution comprising a pharmaceutical of the invention and a bulking agent, e.g., mannitol, trehalose, raffinose, and sucrose or mixtures thereof. A process for preparing a stable lyophilized formulation can include lyophilizing a solution about 2.5 mg/mL protein, about 15 mg/mL sucrose, about 19 mg/mL NaCl, and a sodium citrate buffer having a pH greater than 5.5 but less than 6.5. See, e.g., U.S.20040028670. The compositions and formulations can be delivered by the use of liposomes. By using liposomes, particularly where the liposome surface carries ligands specific for target cells, or are otherwise preferentially directed to a specific organ, one can focus the delivery of the active agent into target cells in vivo. See, e.g., U.S. Patent Nos.6,063,400; 6,007,839; Al-Muhammed (1996) J. Microencapsul.13:293-306; Chonn (1995) Curr. Opin. Biotechnol.6:698-708; Ostro (1989) Am. J. Hosp. Pharm. 46:1576-1587. As used in the present invention, the term “liposome” means a vesicle composed of amphiphilic lipids arranged in a bilayer or bilayers. Liposomes are unilamellar or multilamellar vesicles that have a membrane formed from a lipophilic material and an aqueous interior that contains the composition to be delivered. Cationic liposomes are positively charged liposomes that are believed to interact with negatively charged DNA molecules to form a stable complex. Liposomes that are pH-sensitive or negatively-charged are believed to entrap DNA rather than complex with it. Both cationic and noncationic liposomes have been used to deliver DNA to cells. Liposomes can also include “sterically stabilized” liposomes, i.e., liposomes comprising one or more specialized lipids. When incorporated into liposomes, these specialized lipids result in liposomes with enhanced circulation lifetimes relative to liposomes lacking such specialized lipids. Examples of sterically stabilized liposomes are those in which part of the vesicle-forming lipid portion of the liposome comprises one or more glycolipids or is derivatized with one or more hydrophilic polymers, such as a polyethylene glycol (PEG) moiety. Liposomes and their uses are further described in U.S. Pat. No.6,287,860. Compositions disclosed herein can be administered for prophylactic and/or therapeutic treatments. In some embodiments, for therapeutic applications, compositions are administered to a subject who is infected or at risk of infection with SARS-CoV2, in an amount sufficient to cure, alleviate or partially arrest the clinical manifestations of the disorder or its complications; this can be called a therapeutically effective amount. For example, in some embodiments, pharmaceutical compositions of the invention are administered in an amount sufficient to decrease the number of lung cells infected with SARS-CoV2.
The amount of pharmaceutical composition adequate to accomplish this is a therapeutically effective dose. The dosage schedule and amounts effective for this use, i.e., the dosing regimen, will depend upon a variety of factors, including the stage of the disease or condition, the severity of the disease or condition, the general state of the patient's health, the patient’s physical status, age, and the like. In calculating the dosage regimen for a patient, the mode of administration also is taken into consideration. The dosage regimen also takes into consideration pharmacokinetics parameters well known in the art, i.e., the active agents’ rate of absorption, bioavailability, metabolism, clearance, and the like (see, e.g., Hidalgo-Aragones (1996) J. Steroid Biochem. Mol. Biol. 58:611-617; Groning (1996) Pharmazie 51:337-341; Fotherby (1996) Contraception 54:59-69; Johnson (1995) J. Pharm. Sci.84:1144-1146; Rohatagi (1995) Pharmazie 50:610-613; Brophy (1983) Eur. J. Clin. Pharmacol. 24:103-108; Remington: The Science and Practice of Pharmacy, 21st ed., 2005). The state of the art allows the clinician to determine the dosage regimen for each individual patient, active agent and disease or condition treated. Guidelines provided for similar compositions used as pharmaceuticals can be used as guidance to determine the dosage regimen, i.e., dose schedule and dosage levels, administered practicing the methods of the disclosure are correct and appropriate. Single or multiple administrations of compositions can be given depending on for example: the dosage and frequency as required and tolerated by the patient, the degree and amount of therapeutic effect generated after each administration (e.g., effect on symptoms), and the like. The compositions should provide a sufficient quantity of active agent to effectively treat, prevent or ameliorate conditions, diseases or symptoms. The inhibitory nucleic acids used to practice the methods described herein, can be isolated from a variety of sources, genetically engineered, amplified, and/or expressed/ generated recombinantly. Recombinant nucleic acid sequences can be individually isolated or cloned and tested for a desired activity. Any recombinant expression system can be used, including e.g. in vitro, bacterial, fungal, mammalian, yeast, insect, or plant cell expression systems. Inhibitory nucleic acids of the invention can be inserted into delivery vectors and expressed from transcription units within the vectors. The recombinant vectors can be DNA plasmids or viral vectors. Generation of the vector construct can be accomplished using any suitable genetic engineering techniques well known in the art, including, without limitation, the standard techniques of PCR, oligonucleotide synthesis, restriction endonuclease digestion, ligation, transformation, plasmid purification, and DNA sequencing, for example as described
in Sambrook et al. Molecular Cloning: A Laboratory Manual. (1989)), Coffin et al. (Retroviruses. (1997)) and “RNA Viruses: A Practical Approach” (Alan J. Cann, Ed., Oxford University Press, (2000)). As will be apparent to one of ordinary skill in the art, a variety of suitable vectors are available for transferring nucleic acids of the invention into cells. The selection of an appropriate vector to deliver nucleic acids and optimization of the conditions for insertion of the selected expression vector into the cell, are within the scope of one of ordinary skill in the art without the need for undue experimentation. Viral vectors comprise a nucleotide sequence having sequences for the production of recombinant virus in a packaging cell. Viral vectors expressing nucleic acids of the invention can be constructed based on viral backbones including, but not limited to, a retrovirus, lentivirus, adenovirus, adeno-associated virus, pox virus, or alphavirus. The recombinant vectors capable of expressing the nucleic acids of the invention can be delivered as described herein, and persist in target cells (e.g., stable transformants). Methods of Treating Coronavirus Disease 2019 (COVID-19) Coronavirus disease (COVID-19) is caused by the novel coronavirus SARS-CoV-2. The novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2, and/or any variant of SARS-CoV2) is a positive-sense single-stranded (+ss) RNA virus, wherein the viral genome encodes 29 proteins, which include the four structural proteins, membrane or matrix (M), nucleocapsid (N), envelope (E), and spike (S) proteins. In addition, there are 16 non-structural proteins, NSP1-16, and 9 accessory proteins ORF3a-ORF10, though the expression of some of the accessory factors are still debated. Provided herein are methods of treating Coronavirus disease 2019 (COVID-19) in a subject that include administering to the subject a therapeutically effective amount of a composition comprising an exogenous nucleic acid (e.g., an inhibitory nucleic acid) and delivering the exogenous nucleic acid into a cell, wherein the exogenous nucleic acid binds to a target RNA and modulates gene expression of the target RNA, thereby treating Coronavirus disease 2019 (COVID-19) in the subject. As used herein, the term “exogenous nucleic acid” refers to any DNA or RNA, or a combination thereof, in either a single- or double-stranded form that is introduced from or originating from outside a cell, a tissue or an organism that is not produced by or does not originate from the same cell, tissue, or organism in which it is being introduced. In some embodiments, the exogenous nucleic acid is a single stranded nucleic acid. In some
embodiments, the exogenous nucleic acid is a double stranded nucleic acid. In some embodiments the exogenous nucleic acid is an inhibitory nucleic acid. In some embodiments, the exogenous nucleic acid (e.g., inhibitory nucleic acid) is one of the inhibitory nucleic acids described herein. In some embodiments, the inhibitory nucleotides disclosed herein modulate gene expression within a cell. As used herein, “modulating” can refer to modifying, regulating, or altering the endogenous gene expression in a cell. In some embodiments, modulating gene expression can include systematically influencing RNA stability and/or translation by activating or suppressing the gene expression. In some embodiments, modulation of gene expression can include stabilizing a target RNA. In some embodiments, stabilizing a target RNA can increase translation of the target RNA. In some embodiments, modulation of gene expression can include destabilizing a target RNA. In some embodiments, destabilizing a target RNA can suppress translation of the target RNA. In some embodiments, modulation of gene expression can include increasing translation of a target RNA. In some embodiments, modulation of gene expression can include suppressing translation of a target RNA. In some embodiments, the gene expression of the target RNA is upregulated. In some embodiments, the gene expression of the target RNA is downregulated. In some embodiments, the modulating gene expression comprises knockdown of a gene expressed from the target RNA. In some embodiments, the target RNA interacts with a viral protein. In some embodiments, the target RNA interacting with the viral protein increases protein expression from the gene expressed from the target RNA. In some embodiments, the target RNA interacting with the viral protein inhibits protein expression from the gene expressed from the target RNA. In some embodiments, the modulating comprises blocking the target RNA from interacting with the viral protein. In some embodiments, the modulating comprises blocking the target RNA from interacting with the viral protein wherein the exogenous nucleic acid binds to the target RNA. In some embodiments, the modulating comprises blocking the target RNA from interacting with the viral protein wherein the exogenous nucleic acid binds to the target RNA by targeting the structured genomic RNA that interacts with the viral protein. In some embodiments, an endogenous gene is expressed from the target RNA. In some embodiments, the gene expressed from the target RNA is selected from the group consisting of RPN1, UGGT1, OST4, NDUFA4, HSPA5, PSMD13, LAPTMFA, LAMP1, C1QBP, and LDHB.
In some embodiments, the viral protein is a SARS-CoV-2 protein (e.g., SARS-CoV2 and any variant thereof). In some embodiments, the SARS-CoV-2 protein binds to the 5’ UTR, the coding sequence, or the 3’ UTR region of the target RNA. In some embodiments, the SARS-CoV-2 protein binds to the 5’ UTR region of the target RNA. In some embodiments, the SARS-CoV-2 protein binds to the coding sequence region of the target RNA. In some embodiments, the SARS-CoV-2 protein binds to the 3’ UTR region of the target RNA. In some embodiments, the delivering of an exogenous nucleic acid into a cell comprises transfection of the exogenous nucleic acid into the cell. In some embodiments, the delivering comprises a virus-based delivery of the exogenous nucleic acid into the cell. In some embodiments, the virus-based delivery comprises a lentivirus. Coronavirus-derived protein As used herein, the term “viral protein” refers to a protein generated by a virus. A viral protein can be both a component and a product of a virus. Viral proteins can be grouped according to their functions, wherein groups of the viral proteins can include structural proteins, nonstructural proteins, regulatory proteins, and accessory proteins. Coronaviruses contain a positive-sense, single-stranded RNA genome, and the viral genome consists of more than 29,000 bases and encodes 29 proteins. SARS-CoV-2 has four structural proteins: the E and M proteins, which form the viral envelope; the N protein, which binds to the virus’s RNA genome; and the S protein, which binds to human receptors. As used herein, a “coronavirus-derived protein” or “SARS-CoV-2 protein” can refer to a protein that is encoded from the coronavirus viral genome. In some embodiments, the SARS-CoV-2 protein can be a structural protein. In some embodiments, the SARS-CoV-2 protein can be a non- structural protein (NSP). In some embodiments, the SARS-CoV-2 protein can comprise a NSP1, a NSP2, a NSP3, a NSP4, a NSP5, a NSP6, a NSP7, a NSP8, a NSP9, a NSP10, a NSP11, a NSP12, a NSP13, a NSP14, a NSP15, or a NSP16 protein. In some embodiments, the SARS- CoV-2 protein can be an accessory protein. In some embodiments, the SARS-CoV-2 protein can comprise a ORF3a, a ORF3b, a ORF6, a ORF7a, a ORF7b, a ORF8, a ORF9b, a ORF9c, or a ORF10 protein. In some embodiments, the SARS-CoV-2 protein comprises a NSP1, a NSP2, a NSP3, a NSP4, a NSP5, a NSP6, a NSP7, a NSP8, a NSP9, a NSP10, a NSP11, a NSP12, a NSP13, a NSP14, a NSP15, a NSP16, a ORF3a, a ORF3b, a ORF6, a ORF7a, a ORF7b, a ORF8, a ORF9b, a ORF9c, or a ORF10 protein. In some embodiments, the SARS- CoV-2 protein is selected from the group consisting of NSP2, NSP3, NSP6, NSP12, NSP14, ORF3b, ORF7b, and ORF9c.
In some embodiments, NSP2, NSP3, NSP6, NSP12, NSP14, ORF3b, ORF7b, and ORF9c interact with host RNA. In some embodiments, inhibitory nucleic acids block the interaction between a coronavirus-derived protein and a host RNA. In some embodiments, the inhibitory nucleic acid modulates gene expression of a host RNA. EXAMPLES Example 1 - eCLIP elucidates SARS-CoV-2 protein-viral RNA interactions in virus infected cells To investigate the RNA interactome of SARS-CoV-2 proteins, eCLIP on SARSCoV-2 infected African Green Monkey kidney (Vero E6) cells was performed, which are an efficiently infected cell line (FIG. 1A). Infected cells were subject to UV irradiation, which covalently crosslinked interacting proteins to RNAs. This was followed by immunoprecipitation of non- structural proteins NSP8 and NSP12, which form part of the replication transcription complex (RTC), and N (nucleocapsid), using protein-specific antibodies to isolate the bound RNA. The RNA-bound proteins were resolved via SDS-PAGE and transferred to nitrocellulose membranes such that only the region spanning the expected protein size and 75 kDa larger were excised and purified in subsequent steps. The same size region of a non- immunoprecipitated input whole cell lysate was included as size-matched input to identify and remove non-specific, enriched sequences. RNA was converted to DNA libraries, sequenced to an average depth of ~25 million reads, and mapped to the SARS-CoV-2 viral genome and African Green Monkey genome to determine SARS-CoV-2 protein RNA interactions. Thus, reads from the immunoprecipitation (IP) samples correspond to RNA crosslinked to the IP enriched proteins NSP8, NSP12, and N, while reads from the input (IN) samples correspond to RNA crosslinked to RBPs at a similar size to the IP protein in the cell milieu. Normalizing read density of IP to IN samples provide a measure of protein-specific RNA interaction. The eCLIP results provide a genome-wide map of RNA interactions with viral proteins during an authentic SARS-CoV-2 infection. Reads were mapped across the entire genome in the positive sense for both IP and IN samples for all proteins, with greater than 96% coverage in all input and IP samples. The near complete coverage implies that most of the viral RNA interacts with RNA binding proteins. Two biological replicates were performed for each protein, and read densities show strong replicate agreement in all samples (Pearson’s
coefficient ≥ 0.87). Input-normalized IP read densities also display strong replicate correlation (Pearson’s coefficient ≥ 0.88). To identify positions with particularly enriched RBP binding, relative positional enrichment (ΔΔReadDensity) was computed by dividing the fold change of read density of IP over IN at each position by the global median fold change for that sample (FIG.1B). A strong relative positional enrichment in NSP8 and NSP12 eCLIP read density was observed at the 5’ end, at 573-fold and 103-fold at position 1 respectively, but only 0.75-fold for N. High relative positional enrichment was also observed for the region before position 66, which marks the start of the leader transcription regulatory site (TRS), with >22-fold for NSP12, >4.9-fold for NSP8, and only >0.6-fold for N (Table 2). In both NSP12 and NSP8, there appeared to be a drop in enrichment earlier, around position 33. This corresponds to the end of Stem Loop 1 (SL1) in the 5’ untranslated region (UTR), potentially implicating SL1 in recruiting, stabilizing or otherwise regulating the replicase proteins. At the 3’ end, NSP12 and NSP8 appear strongly enriched (>5-fold relative to global median) after the stop codon in N at position 29,533 up to the start of the S2M structured region in the 3’ UTR at position 29,695. NSP12 and NSP8 continued to be strongly enriched again after the S2M region at position 29,809. The lack of enrichment at the S2M structure may suggest that its function is unrelated to recruiting replicase proteins NSP12 and NSP8. The eCLIP findings thus provide a map of the direct interaction between replicase proteins NSP8 and NSP12 with regions in the UTRs likely involved in regulating replication and transcription. [Table 2] SARS-CoV-2 genome regions with ^5-fold ΔΔReadDensity
t was o serve t at on y a sma ract on o rea s mappe to t e negat ve sense strand in the input samples (0.00075 for N, 0.00076 for NSP12, 0.0043 for NSP8) and the IP samples for N (0.00046). In contrast, NSP12 and NSP8 IP samples enriched the fraction of negative sense reads to 0.067 and 0.039, about 100- and 10-fold from IN samples, respectively. The negative sense strand coverage in IP samples was also lower in N (33%) than in NSP12 (80%) or NSP8 (58%). NSP12 and NSP8 IP reads are piled up in the 5’ and 3’ regions on the negative sense strand, which is similar to the positive sense strand. The low fraction of reads in the input samples prevents further quantitative assessments. Nevertheless, the findings confirmed the roles for NSP12 and NSP8, but not N, in transcribing negative sense RNA templates to generate mRNAs for translation, and the ability for N to selectively associate with positive sense genomic RNA over negative sense RNAs. Besides regions around the 5’ and 3’ end, several sharp peaks with high relative enrichment in read density were observed (>5-fold, for a contiguous region of 10 nt or more) i.e. 22 peaks for NSP12, 37 peaks for NSP8, and 7 peaks for N (Table 2). NSP12 eCLIP had especially strong peaks; five example major peaks are highlighted (FIG. 1B-1C) at regions 3533-3635, 7436-7526, 17202-17222, 21177-21206, and 24018-24079 (FIG.1B), which have maximum positional fold changes of 19, 14, 9.2, 11 and 13-fold (Table 2), respectively. Intriguingly, when these regions were lined up with SHAPE reactivity data, the peaks
corresponded to regions with low SHAPE Shannon entropy values, which represent regions that are rigid or structured (FIG. 1C). A closer inspection showed these stem-loop structures to have long, stable stems (FIG. 7A). As no strong sequence motifs were observed, it was hypothesized that structural elements in the SARS-CoV-2 genome likely facilitate protein- RNA interactions with NSP12. Recently, it was shown via RNA footprinting with SHAPE structure probing experiments that in addition to RNA-RNA base pairing, some nucleotides with low SHAPE reactivity may be due to direct hydrogen bond interactions with RNA binding proteins. For NSP12 and NSP8, a slight negative correlation between log2 fold change in eCLIP read density and SHAPE entropy was further observed (FIG. 1D), where the latter correlates inversely with structuredness. The eCLIP findings can thus add a layer of functional information – interaction with NSP12 – to structural elements in SARS-CoV-2. It was noticed that the peak at position 7436-7526 is uniquely enriched in NSP12 eCLIP only. Located near the 3’ end of the gene encoding for NSP3, this peak overlapped with a structured region of 3 consecutive stem loops at region 7412-7545 (FIG.1E). To validate the specific protein-RNA interaction, a filter binding assay of NSP12 was performed with in vitro transcribed RNA bearing the sequence in this region (FIG. 1F). A scrambled control was included with the same sequence composition but shuffled such that the structure is no longer preserved (Table 3). The peak RNA showed a binding dissociation constant KD of 17 nM, compared to a KD of 79 nM by the scrambled control (FIG.1G). The scrambled control likely represents the non-specific affinity of NSP12 for RNA (FIG.7B). The sequence and structure of the central hairpin, where the highest point of the peak fold change is located, were also highly conserved among related betacoronaviruses (FIGs. 7C-7D). It also appeared to be located ~500 nt downstream of the steplike reduction in RNA reads extending from the 5’ side of the genome (FIG.7E), though the functional linkage between the two features is unknown. [Table 3] Nucleic acid sequences used in filter binding assay
Finally, the relative eCLIP enrichment between NSP8, NSP12 and N was compared to investigate any similarities in the positive sense RNA interaction between the three proteins. When comparing the relative positional enrichment, N and NSP12 show no correlation (R2 of 0.01), whereas N and NSP8 are slightly correlated (R2 of 0.27; FIG.1H). As expected, NSP8 and NSP12 were the most highly correlated (R2 of 0.57), though there were still substantial differences, as 43% of the variation was unaccounted for by the other protein (FIG. 1H). However, when the data was transformed logarithmically, greater correlations among all three proteins were observed. As logarithmic transformations shed light on signals at lower values, the greater correlations among the log-transformed data of different proteins may imply a greater similarity of more transient protein-RNA interactions. This invites future inquiry about the importance of transient protein-RNA interactions in the life cycle of SARS-CoV-2. To evaluate the blocking of the NSP12 binding to SARS-CoV-2 RNA from position 7414-7555, or disruption of the structure, a locked nucleic acid (LNA) was designed to be complementary to the middle hairpin, specifically position 7472-7493 (FIG. 1I). LNA7472– 7493 (SEQ ID NO: 52 - +T+A+C+A+A+GT+T+G+AT+GA+A+T+TA+C+A+A+C+C) and scrambled LNA (SEQ ID NO: 53 - +G+C+GGC+ACG+TTG+CG+AGT+A+C+T) were synthesized by Integrated DNA Technologies. The LNA was predicted to hybridize with SARS-CoV-2 RNA from position 7414-7555 using the NUPACK software1 (FIG. 1J). Vero E6 cells transfected with LNA7472–7493 and a scrambled LNA were infected with SARS-
CoV-2 (WA1 isolate) and the infection rate was assessed by immunofluorescence 24 hours post infection. It was found that treating cells with LNA7472–7493 reduced the infection rate (as given by integrated intensity) by 64% (p<0.01) (FIGs. 1K-1L). This shows that targeting structured genomic RNA interacting with SARS-CoV-2 protein NSP12 may be a potential antiviral treatment strategy. For therapeutic development, modalities may not be limited to LNAs, but also to include siRNAs and small molecules. Example 2 - SARS-CoV-2 proteins interact with host RNAs in virus-infected cells As RNA virus infections have been shown to perturb host transcriptomes, such as via mRNA degradation, mRNA export inhibition, splicing interference, 5’ cap stealing, and other ways of host translation inhibition, evidence for direct interaction between viral proteins host cell RNAs can shed light on the mechanism and function as a result of these interactions. Therefore, the extent to which NSP8, NSP12 and N interacted with host RNAs was investigated. Targeted transcripts were determined by having one or more peaks that meet the stringent IDR (irreproducible discovery rate) threshold of overlapping peaks between two replicates for every protein, and satisfy statistical cutoffs of p<0.001, and more than 8-fold enrichment in the immunoprecipitated sample (IP) over the size-matched input sample. It was found that NSP8, NSP12 and N interact with 457, 703 and 24 genes with 658, 1457 and 39 significant peaks, respectively (FIG. 2A). Interestingly, the number of RNA reads in Transcripts Per Kilobase Million (TPM) from both NSP8 and NSP12 immunoprecipitation (IP) samples were mapped more frequently to host transcripts than viral RNA (FIG.2B). Among the target genes, NSP12 and NSP8 shared 128 genes in common (18% of NSP12 targets, 26% of NSP8 targets), implying that NSP12 and NSP8 may interact with different host genes in both their individual and complexed states (FIG. 2C). In contrast, a majority of N immunoprecipitated RNA reads were mapped to viral RNA, consistent with its role in enclosing the viral genome during virion assembly. The large number of peaks (2137 total) that map to the 1058 host genes further suggests a potential in perturbing host gene expression that may be required for viral replication. To determine if there are differences in expression levels of the host target genes whose mRNAs are enriched in NSP12 and NSP8 eCLIP, transcriptome-wide mRNA sequencing of SARS-CoV-2 infected Vero E6 cells was performed and the eCLIP target genes were mapped to differential expression levels. It was found that NSP12 and NSP8 target mRNA levels are significantly increased than non-target genes (p<10-5, KS test; FIG. 2D). To understand the processes enriched by the target genes, a Gene Ontology analysis was perfomed. 54 GO processes that are significantly enriched by NSP12 target genes were found (Padjusted<0.01)
(Table 4), whereas no significant GO processes are found in NSP8 target genes. Many of the GO processes fall into three broad categories related to regulating transcription and gene expression, cell cycle and apoptosis, and phosphorylation and signaling processes (FIG.2E). Of the transcription regulation genes, many have antiviral response properties (e.g. NF-κB, BATF, NR4A1, BMP2, SQSTM1, MAFF, MDM2), while others have demonstrated proviral activities, such as DDX5, SFPQ, FBXW11 and ATF-3. Of the genes regulating cell proliferation, cell cycle and apoptosis, PAK2 has been associated with anti-apoptotic signaling and promoting HIV survival, whereas many other genes have overlapping annotation as the transcription regulation genes. A recent study elucidated global phosphorylation changes in cellular proteins upon SARS-CoV-2 infection. Specifically, the p38/MAPK cascade activity is induced by viral infection, and treatment with p38 inhibitors has restrictive effects on viral proliferation. In the NSP12 eCLIP data, enrichment of the MAPK cascade and other signaling pathway genes was also shown (e.g. MAPK1, MAP2K1/3, MAP4K4/5, PIM3, PAK2, EPHA2). In the context of the whole virus infection where a multitude of viral proteins and host defense responses are at play, it cannot be definitively concluded that the interactions between NSP12 and these mRNAs have a causative or inhibitive relationship. Nevertheless, the correlation of NSP12 protein-RNA interactions with these pathway genes, which are relevant to viral infection and host response, leads to hypothesize a potential, albeit unknown, role, and the data represents a rich resource for subsequent mechanism studies. To understand the individual contribution of viral protein-host RNA interactions, the study proceeded to profile the protein- RNA interactions of each SARS-CoV-2 protein. [Table 4] Enriched Gene Ontology processes in NSP12 eCLIP in SARS-CoV-2 infected cells
Example 3 - Exogeneously expressed SARS-CoV-2 proteins interact with one third of the transcriptome in lung epithelial cells Even though NSP12, NSP8, and N eCLIPs were performed in virus infected vero cells, in order to further investigate whether SARS-COV-2 proteins directly interact with the human host transcriptome, eCLIP was performed on the 29 proteins encoded in the SARS-CoV-2 genome and one mutant (FIG.3A). Due to the lack of antibodies specific for most of the viral proteins, the individual proteins were exogenously expressed in a lung epithelial cell line BEAS-2B, which is an immortalized primary bronchial cell line representative of normal lung physiology. Each protein was either fused with a 2xStrep tag and expressed stably via lentiviral transduction or fused with a 3xFLAG tag and expressed transiently via transfection. Following UV crosslinking, the tagged proteins were immunoprecipitated using anti-FLAG or anti-Strep antibodies. Subsequent RNA purification and library purification steps were performed as in the viral eCLIP experiments. Cells expressing only the 3xFLAG or 2xStrep tags and wildtype cells were used as controls to remove background peaks in subsequent analysis steps. From the SARS-CoV-2 proteome-wide eCLIP results, SARS-CoV-2 proteins interacted with RNA represented by 4773 coding genes, which is about a third of the transcriptome of BEAS-2B cells. Nucleocapsid and non-structural proteins NSP2, NSP3, NSP5, NSP9 and NSP12 were found to target the greatest number of unique genes at 1339, 1647, 1199, 902, 863, and 865, respectively (FIG. 3B). The large number of genes targeted by the viral proteins is consistent with the non-structural proteins from the replicase (ORF1ab) having a high affinity for its own RNA, though their potential for widespread interaction with host RNA has not been shown previously. The widespread interaction of Nucleocapsid with host RNAs when expressed in isolation is consistent with its capacity for nonspecific RNA binding, whereas its targeting the virus genome during RNA assembly occurs via interaction with the M protein. For comparison, the extensively studied splicing factor RBFOX2 binds to 958 genes in HepG2 cells and 471 genes in K562 cells, the stress granule assembly factor G3BP1 binds to 561 genes in HepG2 cells,
and the histone RNA hairpin-binding protein SLBP binds to 19 genes in K562 (FIG.3B). This suggests that viral proteins have similar capacities for interacting with RNA as endogenous human RBPs. Target genes with at least one significant eCLIP peak also appear highly distinct across the different SARS-CoV-2 proteins (FIG. 3C). Within individual targets, eCLIP reads also display different profiles, example include N eCLIP peak in 3’ UTR of CXCL1, NSP3 peaks found across all exons in DYNCH1, a NSP12 peak in 5’ UTR of TUSC3 and a NSP2 peak in the intronic region upstream of 3’ splice site of NAP1L4 (FIG. 3D). To cross validate the eCLIP findings that SARS-CoV-2 proteins interact with host cell RNAs, a subset of these proteins were validated as RBPs using crosslinking and solid phase purification (CLASP26), which stringently captures crosslinked protein-RNA interactions due to denaturing wash conditions. HEK293T cells transiently expressing NSP1, NSP2, NSP12, and ORF9c followed by pulldown of total RNA showed enrichment of these proteins (FIG. 3E). For comparison, host RNA binding proteins ELAVL1, YTHDC1 and GAPDH were also included as positive controls, and tubulin as negative controls. Furthermore, RNA interactome capture (RIC27) of poly-A RNA (mRNA, lincRNA, and other POLII transcripts) pulldown was performed using an oligo (dT) primer and found that NSP2 and NSP12 were enriched, but not NSP1 (FIG.3F), which mostly enriched ribosomal RNAs in eCLIP. Distinct processes related to viral replication and host response are targeted by the viral proteins as shown by gene ontology (GO) analysis (FIG.3G). Many of the enriched GO terms are related to nucleic acid and protein synthesis, modification and transport, which is consistent with the primary objective of the virus hijacking host resources for its own biosynthesis and replication. Notably, several protein transport processes are enriched, namely SRP-dependent protein targeting to membrane as enriched by NSP6, ORF3a and N, and COPII vesicle budding and targeting from rough ER to Golgi as enriched by NSP12. These may be involved in viral vesicle formation to serve as replication organelles, as found in a number of positive sense RNA viruses. Immune response processes are also enriched, including neutrophil mediated immunity targeted by NSP12 and platelet degranulation targeted by ORF9c. This supports the choice of lung epithelial cells as a model system that express the relevant cytokines for recruiting immune cells. While the enriched GO terms are highly relevant to viral and host response processes, further analysis of binding patterns is needed to determine if there are any functional implications of viral proteins interacting with these genes. To determine if there are sequence features that the viral proteins recognize, sequence logos from 6-mers of eCLIP peaks were generated. While some of the proteins display strong
sequence preferences, most proteins appear to interact more non-specifically. Some motifs resemble enrichments observed for human RBPs, where M, ORF7a and NSP10 appear to favor G-rich or GU rich motifs, and NSP5 has a motif (GNAUG). Other motifs may result from regional binding preferences (FIG.3H), as NSP2 and NSP9 have a strong preference for UC- rich polypyrimidine motifs (p values of 10-96 and 10-41 respectively), which may be a result of their binding to polypyrimidine tracts in intronic regions, whereas N has an AU-rich motif likely because it preferentially binds to 3’ UTR which contain AU-rich elements. NSP3, a large multifunctional protein, appears to coat entire exons and may not have a meaningful sequence motif. NSP12 primarily binds in the 5’ UTR, and a weakly enriched GUCCCG motif that resembles terminal oligopyrimidine (TOP) motifs hints at a possible role in translation perturbation. The systematic interrogation of SARS-CoV-2 protein-host RNA interactions demonstrates that a majority of SARS-CoV-2 viral proteins are RNA binding proteins that target roughly a third of the human transcriptome. This analysis implies that these viral proteins may be involved in perturbing many essential cellular processes of the host. As eCLIP in virus infected cells are limited by IP-grade antibodies, the data obtained from the exogenous expression of individual proteins in BEAS-2B cells was used for systematic analysis of potential functional implications. Example 4 - SARS-CoV-2 proteins upregulate protein expression of target transcripts By examining the regional binding preferences of each SARS-CoV-2 protein, it was found that SARS-CoV-2 proteins are enriched at distinct regions of target mRNAs, which imply different regulatory functions because of the protein-RNA interaction. Aggregating the analysis of all targeted peaks for each SARS-CoV-2 protein identifies RNA regions that are preferentially bound (FIG.4A). Of note, NSP12, ORF3b, ORF7b and ORF9c show the highest proportion of peaks in the 5’ UTR; NSP2, NSP3, NSP6 and NSP14 show the highest proportion of peaks in the coding region (CDS), NSP5, NSP7 and NSP9 display a high proportion of peaks in intronic regions, and N and NSP15 show the largest proportion of peaks in the 3’ UTR. Also, a metagene analysis of read density was performed across all target mRNA transcripts, where each of the 5’ UTR, CDS and 3’ UTR regions in an mRNA are scaled to standardized lengths (FIG.4B). It was found that even though NSP2 has a similar number and proportion of peaks in the CDS as NSP3, it mainly targets the region spanning the 5’ UTR and coding start. In contrast, NSP3 reads, along with that of NSP6 and NSP14, coat the entire CDS, with a slight bias towards the start of the coding sequence.
Since 8 of the SARS-CoV-2 proteins - NSP2, NSP3, NSP6, NSP12, NSP14, ORF3b, ORF7b and ORF9c - have binding preferences at the 5’ UTR and CDS, it was hypothesized that their protein-RNA interactions could affect expression of the target mRNAs at the level of RNA turnover or translation. To evaluate the functional role of the specific protein-RNA interactions of SARS-CoV-2 proteins and target transcripts, 14 of the proteins with the highest number of unique target coding genes were characterized using tethered function reporter assays (FIG.4C). The individual proteins were fused with an MS2 phage coat protein (MCP), which localizes the tagged protein to MS2 aptamer hairpins inserted in the 3’ UTR of Renilla luciferase. A firefly luciferase without MS2 hairpins is included as a control for non-specific effects of the viral protein. Plasmids encoding the MCP-tagged proteins and reporter constructs were co-transfected into HEK293T cells. Changes in Renilla luciferase activity normalized to firefly luciferase activity measures up- or downregulation of protein expression via either translation or mRNA stability because of positioning the MCP tagged protein in the vicinity of the Renilla mRNA. The luciferase readout does not by itself distinguish between translational or mRNA stabilizing effects. From the tethering experiments, it was found that the ratio of Renilla-MS2 to firefly luciferase for 9 of the 14 SARS-CoV-2 proteins increase 1.9 (NSP6) to 3.5-fold (ORF9c) relative to FLAG-MCP control (p-value < 0.002, two tailed multiple t-test) (FIG. 4D). Interestingly, these SARS-CoV-2 proteins raise the target luciferase activity to greater extent than the tethering of BOLL (1.5-fold), which is a human RBP previously characterized to be amongst the strongest up-regulators from a screen of more than 700 human RBPs. Even though NSP1 eCLIP enriched very few host mRNAs and its peaks are not mapped to the 5’ UTR and CDS, these results for NSP1 are consistent with its ability to enhance the translation of its own mRNA via interacting with the 5’ UTR of the genomic viral mRNA. Of the remaining 5 SARS-CoV-2 proteins, only NSP5, NSP16 and N display slight (but not significant) down-regulation effects (0.73-fold to 0.58-fold) compared to the FLAG peptide control, but to a lesser extent than that of the known translation repressor CNOT7 (0.16-fold). NSP7 and NSP9 appear to have no effect on the relative luciferase activity of the target Renilla reporter. To understand if the increase in luciferase activity is occurring at the RNA or protein level, RT-qPCR was performed to measure the ratio of Renilla-MS2 to Firefly mRNAs. For all the proteins except for NSP2, the Renilla-MS2/Firefly mRNA ratio is significantly increased (p<0.05) compared to wildtype, albeit to different extents for different proteins (FIG.4E). Of note, ORF9c shows the greatest enhancing effect (3.5-fold) in the dual luciferase assay, but its effect on the reporter RNAs is
middling (1.5-fold). However, ORF9c displays the greatest fold change in luciferase activity ratio to RNA ratio (2.3-fold) (FIG. 4F), followed by NSP2 and ORF3b (1.6 and 1.7 fold respectively). The rest of the proteins range from 1.1-fold (NSP6) to 1.5-fold (NSP14), compared to 1.0-fold of BOLL, suggesting that the increase in abundance of the targeted reporter likely occurs at both the RNA and protein levels. Based on the results of the reporter assay, it was hypothesized that SARS-CoV-2 proteins that interact with the 5’ UTR and CDS of target genes could increase their abundance. Since NSP12 demonstrated targeted increase of reporter mRNA levels, NSP12 was transiently overexpressed and mRNA sequencing was performed to determine if there are transcriptome- wide changes in gene expression. By comparing HEK293T cells transfected with NSP12 versus cells transfected with an empty plasmid vector, it was observed that the eCLIP targets of NSP12 have greater log2 fold changes in mRNA levels than genes that are not eCLIP targets of any SARS-CoV-2 protein (p<10-13, KS test; FIG.4G). Genes in the enriched GO processes, such as mitochondrial ATP synthesis and transport, protein N-linked glycosylation and COP II vesicle budding, are similarly upregulated by the overexpression of NSP12 (FIG.4H). These observations provide support for the hypothesis that NSP12 has the capacity to increase the abundance of its eCLIP target RNAs. To determine if SARS-CoV-2 proteins enhance the translation of endogenous genes, polysome profiling was performed on ORF9c, as it demonstrated the greatest ratio of changes in luciferase activity to changes in luciferase mRNA levels. First, the log2 fold changes of polysomal mRNA levels versus total mRNA levels were determined in BEAS-2B cells overexpressing ORF9c and then compared it to wildtype BEAS-2B cells to obtain differential translation rates Δlog2FoldChange. It was observed that the eCLIP targets of ORF9c have higher Δlog2FoldChange in translation rates than genes that are not eCLIP targets of any SARS-CoV-2 protein (p <10-3, KS test; FIG. 4I). Genes in the enriched pathways, such as protein processing in the ER, androgen receptor signaling, and protein N-linked glycosylation are similarly upregulated by the overexpression of ORF9c (FIG. 4J). Among the N-linked glycosylated GO term genes, Ribophorin I (RPN1) is part of an N-oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X- Ser/Thr consensus motif of nascent polypeptide chains, and UDP-Glucose Glycoprotein Glucosyltransferase 1 (UGGT1) is a soluble protein of the endoplasmic reticulum (ER) that selectively glucosylates unfolded glycoproteins. Represented in the mitochondrial ATP synthesis coupled electron transport and the respiratory electron transport chain GO processes, NDUFA4 is part of the enzyme cytochrome-c oxidase (or complex IV) and is important for its
activity and biogenesis. Consistent with the data showing that exogenous expression of ORF9c can interact with RPNI, UGGT1 and NDUFA4 RNA and increase protein expression, it was found that SARS-CoV-2 infection increases RPNI, UGGT1 and NDUFA4 protein levels specifically in infected cells (FIGs.4H-4I). To determine if some of these host RNAs that interact with expression enhancing SARSCoV-2 proteins are pro-viral or antiviral, the impact of siRNA knockdown of these genes was investigated on viral infection or replication in human lung organoid cells (FIGs.6A-6E). Human lung organoids are a physiologically relevant system to study infections and have been shown to be highly infectible by SARS-CoV-2. siRNAs were selected from the target mRNAs of SARSCoV-2 proteins with mRNA stabilization or translation enhancing activities, in addition to an anti-ACE2 siRNA and a scrambled sequence as a negative control. Assays were performed for infected cells by immunofluorescence and infection rate was determined by measuring the total integrated fluorescence intensity of the stained nucleocapsid protein (FIG. 4K). To control for cell viability, the integrated intensity was divided to the area stained by DAPI, and the values were normalized to the scrambled control (FIG. 4L). It was found that siRNA knockdown of RPN1, UGGT1, NDUFA4, HSPA5, PSMD13, LAPTM4A, LAMP1, and LDHB significantly (p<0.05, two-tailed t-test) reduced infection rates for at least one of the tested SARS-CoV-2 variants compared to a scrambled siRNA control (FIG.4L). Of note, siRNA knockdown of NDUFA4, UGGT1 and LAPTM4A significantly reduced viral infection in all three variants. Taken together, these results suggest that SARS-CoV-2 proteins with a preference for binding to 5’ UTR and CDS regions have a capacity for increasing the abundance of target mRNAs and/or translation rates. Furthermore, it was found that eCLIP target genes are associated with enhanced RNA levels via NSP12 overexpression, and increased translation rates with ORF9c overexpression. Example 5 - NSP9 associates with the nuclear pore to block mRNA export Since it was reported that several SARS-CoV-2 proteins are localized to the cell nucleus, it was found that the eCLIP peaks of NSP2, NSP5, NSP7, and NSP9 are enriched in intronic regions (FIG.5A). To test whether these targets are implicated in infection induced alternative splicing, deep sequencing (>50 million 100 nt reads per sample) of SARS-CoV-2 infected A549-ACE2 cells was performed. A total of 1839 alternatively spliced genes were found across all five types of alternative splicing events i.e. alternative 5’ and 3’ splice site, skipped exons, skipped introns and mutually exclusive exons (false discovery rate < 0.1, Inclusion level
difference > 0.05). By comparing genes with eCLIP peaks mapped to intronic regions or splice sites to genes not targeted by any of the SARS-CoV-2 proteins, observed no significant differences were observed in alternative splicing (significance level α = 0.01, KS test). The little or no relationship with splicing led to consideration of other potential ways intronic binding by these SARS-CoV-2 proteins may be affecting the host transcriptome. To infer molecular function, the similarity between each of 223 ENCODE RBP datasets were compared with NSP2, NSP5, NSP7, and NSP9 by computing the Jaccard Index of target genes. It was found that U2AF2’s target gene set is most similar to NSP2, NSP7, and NSP9, and ranks highly for NSP5 (FIG. 5B). However, since the cell lines used for ENCODE – HepG2 and K562 – and in this study – BEAS-2B – are different, the Jaccard indexes are low, at 0.050 for NSP7, 0.054 for NSP9, 0.057 for NSP5, and 0.074 for NSP2. To further ascertain similarity with U2AF2 protein-RNA interactions, the positional read density was inspected. The 5’ end of each eCLIP read can be used to approximate the crosslink site where reverse transcription is aborted or truncated when converting protein-bound RNA to cDNA. By taking the mean of the 5’ read truncation density across all target genes, a strong enrichment for the truncation site was observed at a median of 11 nt upstream of the 3’ splice site (FIG. 5C). Furthermore, a strong overlap was observed between U2AF2 eCLIP performed in both HepG2 and K562 cells with NSP2, NSP5, NSP7, and NSP9, with median truncation site at 10 nt upstream of 3’ splice site, providing evidence of substrate similarity. Using affinity mass- spectrometry, it was shown that NSP9 interacts with several nuclear pore complex proteins, including NUP62, NUP214, NUP88, NUP54 and NUP581 (FIG. 5D). It was confirmed that NUP62 indeed co-immunoprecipitated with NSP9. Even though U2AF2 was not found in the protein-protein interaction network of NSP9, it was previously reported to facilitate the binding of nuclear export factor TAP/NXF1 to its mRNA substrates. From these observations, it was hypothesized that NSP9 may interfere with mRNA export by associating with the nuclear pore and interfering with the U2AF/NXF1 complex for RNA substrate recognition (FIG. 5E). The significance of NSP2, NSP5, and NSP7 association with this intronic region may be less clear, and will benefit from future studies for clarification. To determine if NSP9 inhibits mRNA export activity, assays were performed for the mRNA levels of NSP9 target genes in cytosolic and nuclear fractions. Both NSP9 expressing BEAS-2B cells and the wild type BEAS-2B cells were fractionated, followed by RNA extraction and preparation for mRNA sequencing. No difference was observed in log2 fold changes of mRNA levels in NSP9 overexpressing cells versus wildtype cells between NSP9 eCLIP targets and non-targets, which agrees with the observation of lack of perturbation of
target gene expression in the dual reporter assay. However, NSP9 eCLIP targets displayed greater log2 fold change of mRNA levels in the nuclear fraction and lower levels in the cytosolic fraction than non-target genes (FIG. 5E). To validate the sequencing results, individual RT-qPCR were performed on the subcellular fractionated RNAs for individual target mRNAs IL-1α, ANXA2 and UPP1 (FIG.5F), and lower cytosolic to total mRNA ratios were observed in NSP9-expressing versus parental cells, whereas the cytosolic mRNA levels of non- targeted controls MALAT1 and UBC were not significantly lowered (FIG.5G). Even though nuclear RNA fractions were purified at high yields (>1 μg/μl), the RT-qPCR CT values of the target genes were too high (>25 cycles) for accurate quantification. Interleukin 1α (IL-1α) and interleukin 1β (IL-1β) are important inflammatory cytokines constitutively produced in epithelial cells and plays a central role in regulating immune responses, including being a master cytokine in acute lung inflammation. To determine if NSP9 inhibiting the nucleocytoplasmic export of the mRNA of IL-1α has any impact on the production of this cytokine, an ELISA was performed on the growth media of BEAS-2B wild type and NSP9 expressing cells 48 hours after induction by several common cytokines. Interferon α, β and γ resulted in lowered IL-1α levels in NSP9 cells compared to wild type, though tumor necrosis factor alpha (TNFα) resulted in the greatest reduction (~ 30%) (FIG. 5H). The observation of reduced IL-1α produced at different concentrations of TNFα (FIG.5I) was reproduced. In addition, reduced IL-1β produced in NSP9 expressing cells than in wildtype BEAS-2B cells was observed (FIG. 5J). Thus, NSP9 association with the nuclear pore complex proteins aligns with the observation of decreased cytoplasmic abundance of NSP9 target mRNAs, suggesting that NSP9 interaction may directly inhibit nuclear export. Further, NSP9 reduced the production of its target gene IL-1α/β, which suggests that the export inhibition mechanism may be a strategy that SARSCoV-2 employs to dampen inflammatory host response. Taken together, similarities in intronic protein-RNA interactions were observed by nonstructural proteins 2, 5, 7, and 9, which resembles the binding profile of splicing factor U2AF2. It was further shown that NSP9 reduces cellular mRNA export, likely by interfering with U2AF/NXF1 substrate recognition. These findings suggest NSP9 may contribute to the viral effort in suppressing host gene expression.
Claims
WHAT IS CLAIMED IS: 1. A pharmaceutical composition comprising an inhibitory nucleic acid that is complementary to a portion of a target RNA.
2. The composition of claim 1, wherein the target RNA comprises a portion of the SARS-CoV-2 genome.
3. The composition of claim 1, wherein the target RNA comprises a portion of human RNA.
4. The composition of claim 2 or 3, wherein the target RNA comprises human mRNA.
5. The composition of any one of claims 1-4, wherein the target RNA is associated with a portion of the 5’ untranslated region (UTR) of the target RNA.
6. The composition of any one of claims 1-5, wherein the target RNA is associated with a portion of the coding region of the target RNA.
7. The composition of any one of claims 1-6, wherein the target RNA is associated with a portion of the 3’ untranslated region of the target RNA.
8. The composition of any one of claims 1-7, wherein a SARS-CoV-2 protein selected from the group consisting of NSP2, NSP3, NSP5, NSP6, NSP7, NSP9, NSP12, NSP14, NSP15, ORF3b, ORF7b and ORF9c is capable of binding to a portion of the target RNA.
9. The composition of any one of claims 1-8, wherein the target RNA is associated with a gene selected from the group consisting of RPN1, UGGT1, OST4, NDUFA4, HSPA5, PSMD13, LAPTM4A, LAMP1, C1QBP, and LDHB.
10. The composition of any one of claims 1-9, wherein the inhibitory nucleic acid comprises one or more small interfering RNAs (siRNA).
11. The composition of claim 10, wherein the one or more siRNAs comprise at least one sequence selected from the group consisting of SEQ ID NOs: 1-40.
12. The composition of any one of claims 1-11, wherein the inhibitory nucleic acid comprises one or more antisense oligonucleotides.
13. The composition of any one of claims 1-11, wherein the inhibitory nucleic acid comprises one or more locked nucleic acids (LNA).
14. The composition of claim 13, wherein the one or more LNAs comprises a sequence selected from the group consisting of SEQ ID NOs: 52 and 53.
15. The composition of any one of claims 13-14, wherein the inhibitory nucleic acid comprises one or more phosphorothioate substitutions.
16. The composition of any of claims 13-15, wherein the inhibitory nucleic acid comprises one or more 2’-O-methyl modifications.
17. The composition of any one of claims 13-16, wherein the inhibitory nucleic acid comprises one or more 2’-O-methoxyethyl modifications.
18. The composition of any one of claims 13-17, wherein the inhibitory nucleic acid comprises one or more 2’-fluoro modifications.
19. The composition of any one of claims 1-18, wherein the one or more siRNAs, antisense oligonucleotides, or locked nucleic acids, are a plurality, wherein each siRNA, antisense oligonucleotide, or locked nucleic acid comprises the same sequence as the other siRNAs, antisense oligonucleotides, or locked nucleic acids in the plurality.
20. The composition of any one of claims 1-18, wherein the one or more siRNAs, antisense oligonucleotides, or locked nucleic acids contain a mixture of siRNAs, antisense oligonucleotides, or locked nucleic acids comprising at least two different sequences.
21. The composition of any of claims 13-20, wherein the LNA is enclosed within a vector.
22. The composition of claim 21, wherein the vector is a lentivirus.
23. The composition of claim 21, wherein the vector is an adenovirus.
24. The composition of claim 21, wherein the vector is an adeno-associated viral vector (AAV).
25. The composition of claim 21, wherein the vector is a liposome.
26. A method of treating Coronavirus disease 2019 (COVID-19) in a subject, the method comprising: administering to the subject a therapeutically effective amount of the pharmaceutical composition of any one of claims 1-25, thereby treating Coronavirus disease 2019 (COVID-19) in the subject.
27. The method of claim 26, wherein the therapeutically effective amount of the pharmaceutical composition decreases target RNA protein expression.
28. A method of reducing risk of SARS-CoV2 infection in a subject, the method comprising: administering to the subject a therapeutically effective amount of the pharmaceutical composition of any one of claims 1-25, thereby reducing risk of SARS-CoV2 infection in the subject.
29. An inhibitory nucleic acid which is or comprises SEQ ID NOs: 1-40, wherein the inhibitory nucleic acid comprises a siRNA.
30. An inhibitory nucleic acid which is or comprises SEQ ID NOs: 52 or 53, wherein the inhibitory nucleic acid comprises a LNA.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163196005P | 2021-06-02 | 2021-06-02 | |
US63/196,005 | 2021-06-02 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2022256409A2 true WO2022256409A2 (en) | 2022-12-08 |
WO2022256409A3 WO2022256409A3 (en) | 2023-01-19 |
Family
ID=84324511
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2022/031773 WO2022256409A2 (en) | 2021-06-02 | 2022-06-01 | Compositions and methods of modulating rna and protein interactions |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2022256409A2 (en) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2626433B1 (en) * | 2012-02-09 | 2017-04-12 | Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. | Method for linking nucleic acids in a microsome from endoplasmatic reticuli |
JP2022544935A (en) * | 2019-08-14 | 2022-10-24 | コディアック バイオサイエンシーズ, インコーポレイテッド | Extracellular vesicles - NLRP3 antagonists |
CN112574960B (en) * | 2020-12-18 | 2022-09-23 | 中国科学院生物物理研究所 | siRNA for efficiently cutting SARS-CoV-2 genome and application thereof |
-
2022
- 2022-06-01 WO PCT/US2022/031773 patent/WO2022256409A2/en unknown
Also Published As
Publication number | Publication date |
---|---|
WO2022256409A3 (en) | 2023-01-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10093923B2 (en) | Modulation of HSP47 expression | |
JP6072842B2 (en) | Treatment of brain-derived neurotrophic factor (BDNF) -related diseases by suppression of natural antisense transcripts against BDNF | |
JP6066035B2 (en) | Treatment of glial cell-derived neurotrophic factor (GDNF) -related diseases by suppression of natural antisense transcripts against GDNF | |
JP2016531570A (en) | Oligonucleotides targeting the euchromatin region | |
US20150225722A1 (en) | Methods for selective targeting of heterochromatin forming non-coding rna | |
EP2683411B1 (en) | Methods of using microrna-26a to promote angiogenesis | |
US20200054746A1 (en) | Methods for increasing neuronal survival | |
BRPI0617860A2 (en) | small interfering oligonucleotides comprising arabinose-modified nucleotides | |
WO2018160356A1 (en) | Genetic and pharmacological transcriptional upregulation of the repressed fxn gene as a therapeutic strategy for friedreich ataxia | |
WO2022256409A2 (en) | Compositions and methods of modulating rna and protein interactions | |
US10260067B2 (en) | Enhancing dermal wound healing by downregulating microRNA-26a | |
US20210290653A1 (en) | Modulation of lnc05 expression | |
US10378015B2 (en) | Targeting hepatitis B virus (HBV) host factors | |
US20230287427A1 (en) | Inhibition of lncExACT1 to Treat Heart Disease | |
WO2021062251A2 (en) | Targeting rlim to modulate body weight and obesity | |
JP2023530661A (en) | Treatment of left ventricular dysfunction after acute myocardial infarction | |
EP4185693A1 (en) | Combinatorial inhibition of transcription factors for treatment of heart failure | |
CN117940564A (en) | Combination of antisense oligomers | |
AU2015200064A1 (en) | Modulation of hsp47 expression |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22816774 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |